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Lepidopteran-Active Bacillus thuringiensis 
5-Endotoxin Compositions and Methods of Use 

1 .0 Background of the Invention 

1.1 Field of the Invention 

The present invention relates generally to the fields of molecular biology. More 
particularly, certain embodiments concern methods and compositions comprising DNA 
segments, and proteins derived from bacterial species. More particularly, it concerns novel 
genes from Bacillus thuringiensis encoding lepidopteran-toxic crystal proteins. Various methods 
for making and using these DNA segments, DNA segments encoding synthetically-modified Cry 
proteins, and native and synthetic crystal proteins are disclosed, such as, for example, the use of 
DNA segments as diagnostic probes and templates for protein production, and the use of 
proteins, fusion protein carriers and peptides in various immunological and diagnostic 
applications. Also disclosed are mediods of making and using nucleic acid segments in the 
development of transgenic plant cells containing the DNA segments disclosed herein. 

1.2 Description of the Related Art 

Almost all field crops, plants, and commercial farming areas are susceptible to attack by 
one or more insect pests. Particularly problematic are Coleopteran and Lepidoptem pests. For 
example, vegetable and cole crops such as artichokes, kohlrabi, arugula, leeks, asparagus, lentils, 
beans, lettuce (e.g., head, leaf, romaine), beets, bok choy, malanga, broccoli, melons (e.g., 
muskmelon, watermelon, crenshaw, honeydew,cantaloupe), brussels sprouts, cabbage, cardoni, 
carrots, napa, cauliflower, okra, onions, celery, parsley, chick peas, parsnips, chicory, peas, 
Chinese cabbage, peppers, collards, potatoes, cucumber, pumpkins, cucurbits, radishes, dry bulb 
onions, rutabaga, eggplant, salsify, escarole, shallots, endive, soybean, garlic, spinach, green 
onions, squash, greens, sugar beets, sweet potatoes, turnip, swiss chard, horseradish, tomatoes, 
kale, turnips, and a variety of spices are sensitive to infestation by one or more of the following 
insect pests: alfalfa looper, armyworra, beet armyworm, artichoke plume moth, cabbage 
budworm, cabbage looper, cabbage webworm, com earworm, celery leafeater, cross-striped 
cabbageworm, european com borer, diamondback moth, green cloverworm, unported 
cabbageworm, melonworm, onmivorous leafroUer, pickleworm, rindworm complex, saltmarsh 
caterpillar, soybean looper, tobacco budworm, tomato fruitworm, tomato homworm, tomato 
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pinworm, velvetbean caterpillar, and yellowstriped armyworm. Likewise, pasture and hay crops 
such as alfelfa, pasture grasses and silage are often attacked by such pests as armyworm, beef 
armyworm, alfelfa caterpillar, European skipper, a variety of loopers and webworms, as well as 

yellowstriped armyworms. 

Fruit and vine crops such as apples, apricots, cherries, nectarines, peaches, pears, plums, 
prunes, quince almonds, chestouts, filberts, pecans, pistachios, walnuts, citrus, blackberries, 
blueberries, boysenberries. cranberries, currants, loganberries, raspberries, strawberries, grapes, 
avocados, bananas, kiwi, persimmons, pomegranate, pineapple, tropical fruits are often 
susceptible to attack and defoliation by achema sphinx moth, amorbia, armyworm. citrus 
cutworm, banana skipper, blackheaded fiieworm. blueberry leafioller, cankerworm. cherry 
fruitworm. citrus cutworm, cranberry girdler, eastern tent caterpillar, fall webworai, fall 
webworm, filbert leafroller, filbert webworm, fiiiit tree leafiroUer. grape berry moth, grape 
leafFolder, grapeleaf skeletonizer, green fruitworm, gummosos-batrachedra conunosae, gypsy 
moth, hickory shuckworm, homworms, loopers. navel orangeworm. obliquebanded leafiroUer, 
omnivorous leafroller. omnivorous looper, orange tortrix, orangedog, oriental fruit moth, 
pandemis leafroller, peach twig borer, pecan nut casebearer, redbanded leafi^ller. redhumped 
caterpillar, roughskinned cutworm, saltmarsh caterpillar, spanworm, tent caterpillar, thecla- 
thecla basiUides. tobacco budworm, tortrix moth, tufted apple budmoth, variegated leafroUer, 
walnut caterpillar, western tent caterpillar, and yellowstriped armyworm. 

Field crops such as canola/rape seed, evening primrose, meadow foam, com (field, sweet, 
popcorn), cotton, hops, jojoba, peanuts, rice, safflower, small grains (barley, oats, rye, wheat, 
etc.), sorghum, soybeans, sunflowers, and tobacco are often targets for infestation by insects 
including armyworm. asian and other com borers, banded sunflower moth, beet armyworm, 
bollworm. cabbage looper. com rootworm (including southern and western varieties), cotton leaf 
perforator, diamondback moth, european com borer, green cloverworm, headmoth. headworm, 
imported cabbageworm. loopers (including Anacamptodes spp.). obliquebanded leafiroller, 
omnivorous leaftier. podworm. podworm. saltmarsh caterpillar, southwestem com borer, 
soybean looper, spotted cutworm, sunflower moth, tobacco budworm, tobacco homworm, 
velvetbean caterpillar. 

Bedding plants, flowers, ornamentals, vegetables and container stock are frequently fed 
upon by a host of insect pests such as armyworm, azalea moth, beet armyworm, diamondback 
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moth, ello moth (homworm), Florida fern caterpillar, lo moth, loopers, oleander moth, 
omnivorous leafroller, omnivorous looper, and tobacco budworm. 

Forests, fruit, ornamental, and nut-bearing trees, as well as shrubs and other nursery stock 
are often susceptible to attack from diverse insects such as bagworm, blackheaded budworm, 
browntail moth, California oakworm, douglas fir tussock moth, elm spanworm, fall webworm, 
fruittree leafroller, greenstriped mapleworm, gypsy moth, jack pine budworm, mimosa 
webwom, pine butterfly, redhumped caterpillar, saddleback caterpillar, saddle prominent 
caterpillar, spring and fall cankerworm, spruce budworm, tent caterpillar, tortrix, and western 
tussock moth. Likewise, turf grasses are often attacked by pests such as armyworm, sod 
webworm, and tropical sod webworm. 

Because crops of commercial interest are often the target of insect attack, 
environmentally-sensitive methods for controlling or eradicating insect infestation are desirable 
in many instances. This is particularly true for fanners, nurserymen, growers, and conunercial 
and residential areas which seek to control insect populations using eco-friendly compositions. 

The most widely used environmentally-sensitive insecticidal formulations developed in 
recent years have been composed of microbial pesticides derived from the bacterium Bacillus 
thuringiensis. B, thuringiensis is a Gram-positive bacterium that produces crystal proteins or 
inclusion bodies which are specifically toxic to certain orders and species of insects. Many 
different strains of B. thuringiensis have been shown to produce insecticidal crystal proteins. 
Compositions including B, thuringiensis strains which produce insecticidal proteins have been 
commercially-available and used as environmentally-acceptable insecticides because they are 
quite toxic to the specific target insect, but are harmless to plants and other non-targeted 
organisms. 

1.2.1 B. thuringiensis Crystal Proteins S-Endotoxins 

5-endotoxins are used to control a wide range of leaf-eating caterpillars and beetles, as 
well as mosquitoes. These proteinaceous parasporal crystals, also referred to as insecticidal 
crystal proteins, crystal proteins, Bt inclusions, crystaline inclusions, inclusion bodies, and Bt 
toxins, are a large collection of insecticidal proteins produced by B. thuringiensis that are toxic 
upon ingestion by a susceptible insect host. Over the past decade research on the structure and 
fimction of 5. thuringiensis toxins has covered all of the major toxin categories, and while these 
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toxins differ in specific strocture and function, general similarities in the structure and function 
are assumed. Based on the accumulated knowledge of B. thuringiensis toxins, a generalized 
mode of action for B. thuringiensis toxins has been created and includes: ingestion by the insect, 
solubilization in the insect midgut (a combination stomach and small intestine), resistance to 
digestive enzymes sometimes with partial digestion actually "activating" die toxin, binding to tiie 
midgut cells, formation of a pore in the insect ceils and tiie disruption of cellular homeostasis 

(English and Slatin, 1992). 

One of the unique features of 5. thuringiensis is its production of crystal proteins during 
sporulation which are specifically toxic to certain orders and species of insects. Many different 
strains of B. thuringiensis have been shown to produce insecticidal crystal proteins. 
Compositions including B. thuringiensis strains which produce proteins having insecticidal 
activity against lepidopteian and dipteran insects have been commercially available and used as 
environmentally-acceptable insecticides because they are quite toxic to tiie specific target insect, 
but are harmless to plants and other non-targeted organisms. 

The mechanism of insecticidal activity of tfie B. thuringiensis crystal proteins has been 
studied extensively in the past decade. It has been shown that tiie crystal proteins are toxic to tiie 
insect only after ingestion of the protein by tiie insect. The alkaline pH and proteolytic enzymes 
in the insect mid-gut solubilize tiie proteins, tiiereby allowing tiie release of components which 
are toxic to tiie insect. These toxic components disrupt the mid-gut cells, cause tiie insect to 
cease feeding, and, eventually, bring about insect deatii. For this reason, B. thuringiensis has 
proven to be an effective and environmentally safe insecticide in dealing witii various insect 
pests. 

As noted by Hofte and Whiteley (1989), tfie majority of insecticidal B. thuringiensis 
strains are active against insects of the order Lepidoptera, ie., caterpillar insects. Other B. 
thuringiensis strains are insecticidally active against insects of the order Diptera, i.e., flies and 
mosquitoes, or against botii lepidopteran and dipteran insects. In recent years, a few B. 
thuringiensis strains have been reported as producing crystal proteins tiiat are toxic to insects of 
tfie order Coleoptera. i.e., beeties (Krieg et al., 1983; Sick et a!., 1990; Donovan et al., 1992; 
Lambert era/., 1992a; 1992b). 
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1.2.2 Genes Encoding Crystal Proteins 

Many of the 5-endotoxins are related to various degrees by similarities in their amino 
acid sequences. Historically, the proteins and the genes which encode them were classified 
based largely upon their spectrum of insecticidal activity. The review by Hofte and Whiteley 
(1989) discusses the genes and proteins that were identified in B. thuringiensis prior to 1990, and 
sets forth the nomenclature and classification scheme which has traditionally been applied to B, 
thuringiensis genes and proteins, cryl genes encode lepidopteran-toxic Cryl proteins, cryll 
genes encode Cryll proteins that are toxic to both lepidopterans and dipterans. crylll genes 
encode coleopteran-toxic Crylll proteins, while crylV genes encode dipteran-toxic CrylV 
proteins. Based on the degree of sequence similarity, the proteins were fiirther classified into 
subfamilies; more highly related proteins within each family were assigned divisional letters 
such as CrylA, CrylB, CrylC, etc. Even more closely related proteins within each division were 
given names such as CrylCl, CryIC2, etc. 

Recently, a new nomenclature was developed which systematically classified the Cry 
proteins based upon amino acid sequence homology rather than upon insect target specificities 
(Crickmore et al, 1998). The classification scheme for many known toxins, including allelic 
variations in individual proteins, is summarized and regularly updated at 
http://www.biols.susx,ac.uk/Home/Neil_Crickmore/Bt/. The informationw was most recently 
updated as of April 27, 1999 and is herein incorporated by reference. 

1.2.3 Crystal Proteins Toxic to Lepidopteran Insects 

2.0 Summary of the Invention 

The recent review by Schnepf et al (1998) describes the enormous diversity of 
insecticidal crystal proteins derived fi^m B, thuringiensis. Cry proteins of the Cryl, Cry2, and 
Cry9 classes are particularly known for their toxicity towards lepidopteran larvae, however, the 
degree of toxicity varies significantly depending on the target lepidopteran pest (Hofte and 
Whiteley, 1989). For instance, Cryl Ac shows poor toxicity towards the armyworm, Spodoptera 
littoralis, but strong toxicity towards the tobacco budworm, Heliothis virescens. In addition, 
slight variations in amino acid sequence within a Cry protein class can dramatically impact 
insecticidal activity (see Schnepf et al, 1998 and references therein). The Cry3Ba and Cry3Bb 
genes, for instance, share 94% amino acid sequence identity, but only Cry3Bb exhibits 
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significant toxicity towards the Southern com rootwom, Diabrotica undecimpunctata howardi 
(Donovan et al, 1992). Similarly, Cry2Aa and Cry2Ab share 87% amino acid sequence identity, 
yet only Cry2Aa displays toxicity towards mosquitos (Widner and Whiteley, 1990). Von 
Tersch et al (1991) demonstrated that CrylAc proteins varying by only seven amino acids 
(>99% sequence identity) nevertheless show significant differences in insecticidal activity. Lee 
et al (1996) reported that CrylAb alleles differing at only two amino acid positions exhibited a 
10-fold difference in toxicity towards the gypsy moth, Lymantria dispar. Thus, even Cry 
proteins that are considered to be alleles of known Cry proteins or to belong to a Cry protein 
subclass (Crickmore et al, 1998) may have unique and useful insecticidal properties. 
International Patent Application Publication No. WO 98/00546 and WO 98/40490 describe a 
variety of Cryl-, Cry2-, and Cry9-related crystal proteins obtained from B. thurmgiensis. 

2.1 Cty DNA Segments 

The present invention concerns nucleic acid segments, that can be isolated from virtually 
any source, that are free firom total genomic DNA and that encode the novel peptides disclosed 
herein. Nucleic acid segments encoding these polypeptides may encode active proteins, peptides 
or peptide fragments, polypeptide subunits, functional domains, or the like of one or more crystal 
proteins. In addition the invention encompasses nucleic acid segments which may be 
synthesized entirely in vitro using methods that are well-known to those of skill in the art which 
encode the novel Cry polypeptides, peptides, peptide fragments, subunits, or functional domains 
disclosed herein. 

As used herein, the term "nucleic acid segment" refers to a polynucleotide molecule that 
has been isolated free of total genomic DNA of a particular species. Therefore, a nucleic acid 
segment encoding an endotoxin polypeptide refers to a nucleic acid segment that comprises one 
or more crystal protein-encoding sequences yet is isolated away from, or purified free from, total 
genomic DNA of the species from which the nucleic acid segment is obtained, which in the 
instant case is the genome of the Gram-positive bacterial genus. Bacillus^ and in particular, the 
species of Bacillus known as B. thuringiensis. Included within the term "nucleic acid segment", 
are polynucleotide segments and smaller fragments of such segments, and also recombinant 
vectors, including, for example, plasmids, cosmids, phagemids, phages, viruses, and the like. 

Similarly, a DNA segment comprising an isolated or purified crystal protein-encoding 
gene refers to a DNA segment which may include in addition to peptide encoding sequences. 
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certain other elements such as, regulatory sequences, isolated substantially away from other 
naturally occurring genes or protein-encoding sequences. In this respect, the term "gene" is used 
for simplicity to refer to a functional protein-, polypeptide- or peptide-encoding unit. As will be 
understood by those in the art, this functional term includes both genomic sequences, operon 
sequences and smaller engineered gene segments that express, or may be adapted to express, 
proteins, polypeptides or peptides. Also, the term includes an expression cassette comprising at 
least a promoter operably linked to one or more protein coding sequences, operably linked to at 
least a transcriptional termination sequence. 

"Isolated substantially away from other coding sequences" means that the gene of 
interest, in this case, a nucleic acid segment or gene encoding all or part of a bacterial 
insecticidal crystal protein, forms the significant part of the coding region of the DNA segment, 
and that the DNA segment does not contain large portions of naturally-occurring coding DNA, 
such as large chromosomal fragments or other functional nucleic acid segments or genes or 
operon coding regions. Of course, this refers to the DNA segment as originally isolated, and 
does not exclude genes, recombinant genes, synthetic linkers, or coding regions later added to 
the segment by the hand of man. 

In particular embodiments, the invention concerns isolated DNA segments and 
recombinant vectors incorporating DNA sequences that encode a Cry peptide species that 
includes within its amino acid sequence an amino acid sequence essentially as set forth in SEQ 
ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID N0:12, SEQ 
ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20. SEQ ID NO:22, SEQ ID NO:24, 
SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID 
NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ 
ID NO:48, SEQ ID NO:50 and SEQ ID NO. 63. 

The term "a sequence essentially as set forth m SEQ ID NO:2, SEQ ID NO:4, or SEQ ID 
NO:6," for example, means that the sequence substantially corresponds to a portion of the 
sequence of SEQ ID N0:2, SEQ ID NO:4, or SEQ ID N0:6 and has relatively few amino acids 
that are not identitical with, or a biologically functional equivalent of, the amino acids of any of 
these sequences. The term "biologically functional equivalent" is well understood in the art and 
is further defined in detail herein {e.g., see Illustrative Embodiments). Accordingly, sequences 
that have from about 70% to about 80%, or more preferably about 81, 82, 83, 84, 85, 86, 87, 88, 
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89. or about 90%, or even more preferably about 91, 92, 93. 94, 95, 96, 97, 98, or about 99% 
amino acid sequence identity or functional equivalence to the amino acids of SEQ ID NO:2, SEQ 
ID N0:4, SEQ ID N0:6, SEQ ID N0:8, SEQ ID NO:10, SEQ ID N0:12, SEQ ID NO:14, SEQ 
ID N0:16, SEQ ID N0:18, SEQ ID NO:20, SEQ ID NO:22. SEQ ID NO:24, SEQ ID NO:26, 
SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36. SEQ ID 
NO:38. SEQ ID NO:40. SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ 
ID NO:50 and SEQ ID NO. 63 will be sequences that are "essentially as set forth in SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:6. SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12. SEQ ID 
NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ 
ID NO:26. SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34. SEQ ID NO:36. 
SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44. SEQ ID NO:46, SEQ ID 
NO:48, SEQ ID NO:50 and SEQ ID NO: 63." 

In addition, sequences that have ftom about 70% to about 80%, or more preferably about 
81, 82, 83, 84, 85, 86, 87, 88, 89, or about 90%, or even more preferably about 91, 92, 93, 94. 
95, 96, 97, 98, or about 99% nucleic acid sequence identity or functional equivalence to the 
nucleic acids of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, 
SEQ ID NO:ll, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID N0:19, SEQ ID 
NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ 
ID NO:33, SEQ ID NO:35. SEQ ID NO:37, SEQ ID NO:39, SEQ ID N0:41, SEQ ID NO:43, 
SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49 and SEQ ID NO:62 will be sequences that are 
"essentiaUy as set forth in SEQ ID NO:l, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:7, SEQ ID 
NO:9. SEQ ID N0:11, SEQ ID N0:13, SEQ ID N0:15, SEQ ID NO:17, SEQ ID NO:19, SEQ 
ID N0:21, SEQ ID NO:23, SEQ ID NO:25. SEQ ID N0.27, SEQ ID NO:29, SEQ ID NO:31, 
SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39. SEQ ID N0.41, SEQ ID 
NO:43, SEQ ID NO:45, SEQ ID N0:47, SEQ ID N0:49 and SEQ ID NO:62." 

It will also be understood that amino acid and nucleic acid sequences may include 
additional residues, such as additional N- or C-terminal amino acids or 5' or 3' sequences, and 
yet still be essentially as set forth in one of the sequences disclosed herein, so long as the 
sequence meets the criteria set forth above, including the maintenance of biological protein 
activity where protein expression is concerned. The addition of tenninal sequences particularly 
applies to nucleic acid sequences that may, for example, include various non-coding sequences 
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flanking either of the 5' or 3' portions of the coding region or may include various internal 
sequences, i.e., introns, which are known to occur within genes. 

The nucleic acid segments of the present invention, regardless of the length of the coding 
sequence itself, may be combined with other DNA sequences, such as promoters, 
polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding 
segments, and the like, such that their overall length may vary considerably. It is therefore 
contemplated that a nucleic acid fragment of ahnost any length may be employed, with the total 
length preferably being limited by the ease of preparation and use in the intended recombinant 
DNA protocol. For example, nucleic acid fragments may be prepared that include a short 
contiguous stretch encoding any of the peptide sequences disclosed in SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NOilO, SEQ ID NO:12, SEQ ID NO:14, SEQ ID 
NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ 
ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, 
SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID 
NO:50 and SEQ ID NO: 63, or that are identical with or complementary to DNA sequences 
which encode any of the peptides disclosed in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID 
NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ 
ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, 
SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID Na48, SEQ ID NO:50 and SEQ ID 
NO: 63, and particularly those DNA segments disclosed in SEQ ID NO:l, SEQ ID NO:3, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID N0:9, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID NO:15, SEQ 
ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID 
NO:39, SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49 and 
SEQ ID NO:62. For example, DNA sequences such as about 18 nucleotides, and that are up to 
about 10,000, about 5,000, about 3,000, about 2,000, about 1,000, about 500, about 200, about 
100, about SO^and about 14 base pairs in length (including all intermediate lengths) are also 
contemplated to be useful. 

It will be readily understood that "intermediate lengths", in these contexts, means any 
length between the quoted ranges, such as 18, 19, 20, 21, 22, 23, etc.\ 30, 31, 32, etc,\ 50, 51, 52, 
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ss, e/c; 100, 101, 102, 103, etc.; 150, 151, 152, 153, etc.; including all integers in the ranges of 
from about 200-500; 500-1,000; 1,000-2,000; 2,000-3,000; 3,000-5,000; and up to and including 
sequences of about 10,00 or so nucleotides and the like. 

It will also be understood that this invention is not limited to the particular nucleic acid 
sequences which encode peptides of the present invention, or which encode the amino acid 
sequences of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ 
ID N0:12, SEQ ID N0:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, 
SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ 
ID NO:46, SEQ ID NO:48, SEQ ID NO:50 and SEQ ID NO: 63, including those DNA 
sequences which are particularly disclosed in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID N0:7, SEQ ID NO:9, SEQ ID N0:1 1, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 
ID N0:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, 
SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID 
N0:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49 and SEQ ID NO:62. 
Recombinant vectors and isolated DNA segments may therefore variously include the peptide- 
coding regions themselves, coding regions bearing selected alterations or modifications in the 
basic coding region, or they may encode larger polypeptides that nevertheless include these 
peptide-coding regions or may encode biologically functional equivalent proteins or peptides that 
have variant amino acids sequences. 

The DNA segments of the present invention encompass biologically-functional, 
equivalent peptides. Such sequences may arise as a consequence of codon degeneracy and 
functional equivalency that are known to occur naturally within nucleic acid sequences and the 
proteins thus encoded. Alternatively, functionally-equivalent proteins or peptides may be 
created via the application of recombinant DNA technology, in which changes in the protein 
structure may be engineered, based on considerations of the properties of the amino acids being 
exchanged. Changes designed by man may be introduced through the application of site-directed 
mutagenesis techniques, e,g., to introduce improvements to the antigenicity of the protein or to 
test mutants in order to examine activity at the molecular level. 

If desired, one may also prepare fusion proteins and peptides, e.g., where the peptide- 
coding regions are aligned within the same expression xmit with other proteins or peptides having 
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desired functions, such as for purification or inununodetection purposes (e.g., proteins that may 
be purified by affinity chromatography and enzyme label coding regions, respectively). 

Recombinant vectors form further aspects of the present invention. Particularly useful 
vectors are contemplated to be those vectors in which the coding portion of the DNA segment, 
whether encoding a full length protein or smaller peptide, is positioned imder the control of a 
promoter. The promoter may be in the form of the promoter that is naturally associated with a 
gene encoding peptides of the present invention, as may be obtained by isolating the 5' non- 
coding sequences located upstream of the coding segment or exon, for example, using 
recombinant cloning and/or PGR™ technology, in connection with the compositions disclosed 
herein, 

2.2 Cry DNA Segments as Hybridization Probes And Primers 

In addition to their use in directing the expression of crystal proteins or peptides of the 
present invention, the nucleic acid sequences contemplated herein also have a variety of other 
uses. For example, they also have utility as probes or primers in nucleic acid hybridization 
embodiments. As such, it is contemplated that nucleic acid segments that comprise a sequence 
region that consists of at least a 14 nucleotide long contiguous sequence that has the same 
sequence as, or is complementary to, a 14 nucleotide long contiguous DNA segment of SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID 
NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, SEQ 
ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:35, 
SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID 
NO:47, SEQ ID NO:49 and SEQ ID NO:62 will find particular utility. Longer contiguous 
identical or complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000, 
2000, 5000 bp, etc, (including all intermediate lengths and up to and including the full-length 
gene sequences encoding each polypeptide will also be of use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to crystal protein- 
encoding sequences will enable them to be of use in detecting the presence of complementary 
sequences in a given sample. However, other uses are envisioned, including the use of the 
sequence information for the preparation of mutant species primers, or primers for use in 
preparing other genetic constructions. 
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Nucleic acid molecules having sequence regions consisting of contiguous nucleotide 
stretches of about 14 to about 17 or so, 18-25, 26-35, 36-50, or even up to and including 
sequences of about 100-200 nucleotides or so, identical or complementary to DNA sequences of 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID N0:7, SEQ ID N0:9, SEQ ID NO:ll, 
SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID 
NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ 
ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:45, 
SEQ ID NO:47, SEQ ID NO:49 and SEQ ID NO:62, are particularly contemplated as 
hybridization probes for use in, e.g.. Southern and Northern blotting. Smaller fragments will 
generally find use in hybridization embodiments, wherein the length of the contiguous 
complementary region may be varied, such as between about 10-14 and about 100 to 200 or so 
nucleotides, but larger contiguous complementarity stretches may be used, according to the 
length complementary sequences one wishes to detect. 

Of course, fragments may also be obtained by other techniques such as, e.g., by 
mechanical shearing or by restriction enzyme digestion. Small nucleic acid segments or 
fragments may be readily prepared by, for example, directly synthesizing the fragment by 
chemical means, as is commonly practiced using an automated oligonucleotide synthesizer. 
Also, fragments may be obtained by application of nucleic acid reproduction technology, such as 
the PGR™ technology of U. S. Patents 4,683,195 and 4,683,202 (each incorporated herein by 
reference), by introducing selected sequences into recombinant vectors for recombinant 
production, and by other recombinant DNA techniques generally known to those of skill in the 
art of molecular biology. 

Accordingly, the nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules with complementary stretches of DNA fragments. Depending 
on the application envisioned, one will desire to employ varying conditions of hybridization to 
achieve varying degrees of selectivity of probe towards target sequence. For applications 
requiring high selectivity, one will typically desire to employ relatively stringent conditions to 
form the hybrids, e,g., one will select relatively low salt and/or high temperature conditions, such 
as provided by about 0.02 M to about 0.1 5 M NaCl at temperatures of about 50°C to about 70''C. 
Such selective conditions tolerate little, if any, mismatch between the probe and the template or 
target strand, and would be particularly suitable for isolating crystal protein-encoding DNA 
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segments. Detection of DNA segments via hybridization is well-known to those of skill in the 
art, and the teachings of U. S. Patents 4,965,188 and 5,176,995 (each incorporated herein by 
reference) are exemplary of the methods of hybridization analyses. Teachings such as those 
found in the texts of Maloy et al, 1990; Maloy 1994; Segal, 1976; Prokop, 1991; and Kuby, 
1 991 , arc particularly relevant. 

Of course, for some applications, for example, where one desires to prepare mutants 
employing a mutant primer strand hybridized to an underlying template or where one seeks to 
isolate crystal protein-encoding sequences from related species, functional equivalents, or the 
like, less stringent hybridization conditions will typically be needed in order to allow formation 
of the heteroduplex. In these circumstances, one may desire to employ conditions such as about 
0,15 M to about 0.9 M salt, at temperatures ranging from about 20*^0 to about 55°C. Cross- 
hybridizing species can thereby be readily identified as positively hybridizing signals with 
respect to control hybridizations. In any case, it is generally appreciated that conditions can be 
rendered more stringent by the addition of increasing amounts of formamide, which serves to 
destabilize the hybrid duplex in the same manner as increased temperature. Thus, hybridization 
conditions can be readily manipulated, and thus will generally be a method of choice depending 
on the desired results. 

In certain embodiments, it will be advantageous to employ nucleic acid sequences of the 
present invention in combination with an appropriate means, such as a label, for determining 
hybridization. A wide variety of appropriate indicator means are known in the art, including 
fluorescent, radioactive, enzymatic or other ligands, such as avidin/biotin, which are capable of 
giving a detectable signal. In preferred embodiments, one will likely desire to employ a 
fluorescent label or an enzyme tag, such as urease, alkaline phosphatase or peroxidase, instead of 
radioactive or other environmentally undesirable reagents. In the case of enzyme tags, 
colorimetric indicator substrates are known that can be employed to provide a means visible to 
the human eye or spectrophotometrically, to identify specific hybridization with complementary 
nucleic acid-containing samples. 

In general, it is envisioned that the hybridization probes described herein will be useful 
both as reagents in solution hybridization as well as in embodiments employing a solid phase. In 
embodiments involving a solid phase, the test DNA (or RNA) is adsorbed or otherwise affixed to 
a selected matrix or surface. This fixed, single-stranded nucleic acid is then subjected to specific 
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hybridization with selected probes under desired conditions. The selected conditions will depend 
on the particular circumstances based on the particular criteria required (depending, for example, 
on the G+C content, type of target nucleic acid, source of nucleic acid, size of hybridization 
probe, etc.y Following washing of the hybridized surface so as to remove nonspecifically bound 
probe molecules, specific hybridization is detected, or even quantitated, by means of the label. 

2.3 Vectors and Methods for Recombinant Expression of Cry Polypeptides 

In other embodiments, it is contemplated that certain advantages will be gained by 
positioning the codmg DNA segment under the control of a recombinant, or heterologous, 
promoter. As used herein, a recombinant or heterologous promoter is intended to refer to a 
promoter that is not normally associated with a DNA segment encoding a crystal protein or 
peptide in its natural environment. Such promoters may include promoters normally associated 
with other genes, and/or promoters isolated fi-om any bacterial, viral, eukaryotic, or plant cell. 
Naturally, it will be important to employ a promoter that effectively directs the expression of the 
DNA segment in the cell type, organism, or even animal, chosen for expression. The use of 
promoter and cell type combinations for protein expression is generally known to those of skill 
in the art of molecular biology, for example, see Sambrook ei aL, 1989. The promoters 
employed may be constitutive, or inducible, and can be used under the appropriate conditions to 
direct high level expression of the introduced DNA segment, such as is advantageous in the 
large-scale production of recombinant proteins or peptides. Appropriate promoter systems 
contemplated for use in high-level expression include, but are not limited to, the Pichia 
expression vector system (Pharmacia LKB Biotechnology). 

In connection with expression embodiments to prepare recombinant proteins and 
peptides, it is contemplated that longer DNA segments will most often be used, with DNA 
segments encoding the entire peptide sequence being most preferred. However, it will be 
appreciated that the use of shorter DNA segments to direct the expression of crystal peptides or 
epitopic core regions, such as may be used to generate anti-crystal protein antibodies, also falls 
within the scope of the invention. DNA segments that encode peptide antigens from about 8 to 
about 50 amino acids in length, or more preferably, from about 8 to about 30 amino acids in 
length, or even more preferably, from about 8 to about 20 amino acids in length are contemplated 
to be particularly useful. Such peptide epitopes may be amino acid sequences which comprise 
contiguous amino acid sequences from SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
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NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID N0:16, SEQ ID N0:18, SEQ 
ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, 
SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID 
NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50 and SEQ ID NO: 63. 

2,4 cry Transgenes and Transgenic Plants Expressing Cry Polypeptides 

In yet another aspect, tfie present invention provides methods for producing a transgenic 
plant which expresses a nucleic acid segment encoding the novel polypeptides and endotoxins of 
the present invention. The process of producing transgenic plants is well-known in the art. In 
general, the method comprises transforming a suitable host cell with a DNA segment which 
contains a promoter operatively linked to a coding region that encodes one or more CryET31, 
CryET40, CryET43, CryET44, CryET45, CryET46, CryET47, CryET49, CryETSl, CryET52, 
CryET53, CryET54, CryET55, CryET56, CryET57, CryET59, CryET60. CryET61, CryET62. 
CryET63, CryET64, CryET66, CryET67, CryET68, CryET72, CryET73, and CryET83 
polypeptides. Such a coding region is generally operatively linked to a transcription-terminating 
region, whereby the promoter is capable of driving the transcription of the coding region in the 
cell, and hence providing the cell the ability to produce the polypeptide in vivo. Alternatively, in 
instances where it is desirable to control, regulate, or decrease the amount of a particular 
recombinant crystal protein expressed in a particular transgenic cell, the invention also provides 
for the expression of crystal protein antisense mRNA. The use of antisense mRNA as a means of 
controlling or decreasing the amount of a given protein of interest in a cell is well-known in the 
art. 

Another aspect of the invention comprises transgenic plants which express a gene or gene 
segment encoding one or more of the novel polypeptide compositions disclosed herein. As used 
herein, the term "transgenic plant" is intended to refer to a plant that has incorporated DNA 
sequences, including but not limited to genes which are perhaps not normally present, DNA 
sequences not normally transcribed into RNA or translated into a protein ("expressed"), or any 
other genes or DNA sequences which one desires to introduce into the non-transformed plant, 
such as genes which may normally be present in the non-transformed plant but which one desires 
to either genetically engineer or to have altered expression. 

It is contemplated that in some instances either the nuclear or plastidic genome, or both, 
of a transgenic plant of the present invention will have been augmented through the stable 
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introduction of one or more cryETSl cryET40, cryET43, cryET44, cryET45, cryET46, cryET47, 
cryET49, cryETSI, cryET52, cryET53, cryET54, cryET55, cryET56, cryET57, cryET59, 
cryET60, cryET61, cryET62, cryET63, cryET64, cryET66. cryET67, cryET68, cryET72, 
cryET73, and cryET83 transgenes, either native, synthetically modified, or mutated. In some 
instances, more than one transgene will be incorporated into one or more genomes of the 
transformed host plant cell. Such is the case when more than one crystal protein-encoding DNA 
segment is incorporated into the genome of such a plant. In certain situations, it may be 
desirable to have one, two, three, four, or even more B, thuringiemis crystal proteins (either 
native or recombinantly-engineered) incorporated and stably expressed in the transformed 
transgenic plant. 

A preferred gene which may be introduced includes, for example, a crystal protein- 
encoding DNA sequence from bacterial origin, and particularly one or more of those described 
herein which are obtained from Bacillus spp. Highly preferred nucleic acid sequences are those 
obtained from B. thuringiensiSy or any of those sequences which have been genetically 
engineered to decrease or increase the insecticidal activity of the crystal protein in such a 
transformed host cell. 

Means for transforming a plant cell and the preparation of a transgenic cell line are well- 
known in the art, and are discussed herein. Vectors, plasmids, cosmids, YACs (yeast artificial 
chromosomes) and DNA segments for use in transforming such cells will, of course, generally 
comprise either the operons, genes, or gene-derived sequences of the present invention, either 
native, or synthetically-derived, and particularly those encoding the disclosed crystal proteins. 
These DNA constructs can further include structures such as promoters, enhancers, polylinkers, 
or even gene sequences which have positively- or negatively-regulating activity upon the 
particular genes of interest as desired. The DNA segment or gene may encode either a native or 
modified crystal protein, which will be expressed in the resultant recombinant cells, and/or 
which will impart an improved phenotype to the regenerated plant. 

Such transgenic plants may be desirable for increasing the insecticidal resistance of a 
monocotyledonous or dicotyledonous plant, by incorporating into such a plant, a transgenic 
DNA segment encoding one or more CryET31, CryET40, CryET43, CryET44, CryET45, 
CryET46, CryET47, CryET49, CryETSl, CryET52, CryET53, CryET54, CryET55, CryET56, 
CryET57, CryET59, CryET60, CryET61, CryET62, CryET63, CryET64, CryET66, CryET67, 
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CryET68, CryET72, CryET73, and CryET83 polypeptides which are toxic to a lepidopteran 
insect. Particularly preferred plants include turf grasses, kapok, sorghum, cotton, com, soybeans, 
oats, rye, wheat, flax, tobacco, rice, tomatoes, potatoes, or other vegetables, ornamental plants, 
fruit trees, and the like. 

In a related aspect, the present invention also encompasses a seed produced by the 
transformed plant, a progeny from such seed, and a seed produced by the progeny of the original 
transgenic plant, produced in accordance with the above process. Such progeny and seeds will 
have a crystal protein-encoding transgene stably incorporatied into their genome, and such 
progeny plants will inherit the traits afforded by the introduction of a stable transgene in 
Mendelian fashion. All such transgenic plants having incorporated into their genome transgenic 
DNA segments encoding one or more CryET31, CryET40, CryET43, CryET44, CryET45, 
CryET46, CryET47, CryET49, CryETSl, CryET52, CryET53, CryET54, CryET55, CryET56, 
CryET57, CryET59, CryET60, CryET61, CryET62, CryET63, CryET64, CryET66, CryET57, 
CryET68, CryET72, CryET73, and CryET83 crystal proteins or polypeptides are aspects of this 
invention. As well-known to those of skill in the art, a progeny of a plant is understood to mean 
any offspring or any descendant from such a plant, but in this case means any offspring or any 
descendant which also contains the transgene. 

2.5 Site-Specific Mutagenesis 

Site-specific mutagenesis is a technique useful in the preparation of individual peptides, 
or biologically ftinctional equivalent proteins or peptides, through specific mutagenesis of the 
underlying DNA. The technique further provides a ready ability to prepare and test sequence 
variants, for example, incorporating one or more of the foregoing considerations, by introducing 
one or more nucleotide sequence changes into the DNA, The technique of site-specific 
mutagenesis is well known in the art, as exemplified by various publications. 

In general, site-directed mutagenesis in accordance herewith is performed by first 
obtaining a single-stranded vector or melting apart of two strands of a double stranded vector 
which includes within its sequence a DNA sequence which encodes the desired peptide. An 
oligonucleotide primer bearing the desired mutated sequence is prepared, generally synthetically. 
This primer is then annealed with the single-stranded vector, and subjected to DNA polymerizing 
enzymes such as E. coli polymerase I Klenow fhigment, in order to complete the synthesis of the 
mutation-bearing strand. Thus, a heteroduplex is formed wherein one strand encodes the original 
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non-mutated sequence and the second strand bears the desired mutation. This heteroduplex 
vector is then used to transform appropriate cells, such as E. coli cells, and clones are selected 
which include recombinant vectors bearing the mutated sequence arrangement 

The preparation of sequence variants of the endotoxin-encoding nucleic acid segments 
using site-directed mutagenesis is provided as a means of producing potentially useful species 
and is not meant to be limiting as there are other ways in which sequence variants of peptides 
and the DNA sequences encoding them may be obtained. For example, recombinant vectors 
encoding the desired peptide sequence may be treated with mutagenic agents, such as 
hydroxylamine, to obtain sequence variants. 

2.6 Antibody Compositions and Methods of Making 

In particular embodiments, the inventors contemplate the use of antibodies, either 
monoclonal (mAbs) or polyclonal which bind to one or more of the polypeptides disclosed 
herein. Means for preparing and characterizing antibodies are well known in the art (See, e.g., 
Harlow and Lane, 1988; incorporated herein by reference). mAbs may be readily prepared 
through use of well-known techniques, such as those exemplified in U. S. Patent 4,196,265, 
incorporated herein by reference. 

2.7 ELISAs and Immunoprecipitation 

ELISAs may be used in conjunction with the invention. Many different protocols exist 
for performing ELISAs. These are well known to those of ordinary skill in the art. Examples of 
basic ELISA protocols may be found in any standard molecular biology laboratory manual (e.g. 
Sambrook, Fritsch, and Maniatis, eds. Molecular cloning: a laboratory manual. Cold Spring 
Harbor, NY: Cold Spring Harbor Laboratory, 1989). 

2.8 Western Blots 

The compositions of the present invention will fmd great use in inununoblot or western 
blot analysis. Methods of performing immunoblot and western blot analysis are well known to 
those of skill in the are (see Sambrook, et al, ibid). Immunologically-based detection methods 
for use in conjunction with Western blotting include enzymatically-, radiolabel-, or 
fluorescently-tagged secondary antibodies gainst the toxin moiety are considered to be of 
particular use in this regard. 
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2.9 Crystal Protein Screening and Detection Kits 

The present invention contemplates methods and kits for screening samples suspected of 
containing crystal protein polypeptides or crystal protein-related polypeptides, or cells producing 
such polypeptides. A kit may contain one or more antibodies of the present invention, and may 
also contain reagent(s) for detecting an interaction between a sample and an antibody of the 
present invention. The provided reagent(s) can be radio-, fluorescently- or enzymatically-labeled 
or even epitope or ligand tagged. The kit can contain a known radiolabeled agent capable of 
binding or interacting with a nucleic acid or antibody of the present invention. 

The reagent(s) of the kit can be provided as a liquid solution, attached to a solid support 
or as a dried powder. Preferably, when the reagent(s) are provided in a liquid solution, the liquid 
solution is an aqueous solution. Preferably, when the reagent(s) provided are attached to a solid 
support, the solid support can be chromatograph media, a test plate having a plurality of wells, or 
a microscope slide. When the reagent(s) provided are a dry powder, the powder can be 
reconstituted by the addition of a suitable solvent, that may be provided. 

In still further embodiments, the present invention concerns immunodetection methods 
and associated kits. It is proposed that the crystal proteins or peptides of the present invention 
may be employed to detect antibodies having reactivity therewith, or, alternatively, antibodies 
prepared in accordance with the present invention, may be employed to detect crystal proteins or 
crystal protein-related epitope-containing peptides. In general, these methods will include first 
obtaining a sample suspected of containing such a protein, peptide or antibody, contacting the 
sample with an antibody or peptide in accordance with the present invention, as the case may be, 
under conditions effective to allow the formation of an immunocomplex, and then detecting the 
presence of the immunocomplex. 

In general, the detection of immunocomplex formation is quite well known in the art and 
may be achieved through the application of numerous approaches. For example, the present 
invention contemplates the application of ELISA, RIA, immunoblot {e.g., dot blot), indirect 
immunofluorescence techniques and the like. One may find additional advantages through the 
use of a secondary binding ligand such as a second antibody or a biotin/avidin ligand binding 
arrangement, as is known in the art. 

For assaying purposes, it is proposed that virtually any sample suspected of comprising 
either a crystal protein or peptide or a crystal protein-related peptide or antibody sought to be 
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detected, as the case may be, may be employed. It is contemplated that such embodiments may 
have application in the titering of antigen or antibody samples, in the selection of hybridomas, 
and the like. In related embodiments, the present invention contemplates the preparation of kits 
that may be employed to detect the presence of crystal proteins or related peptides and/or 
antibodies in a sample. Samples may include cells, cell supematants, cell suspensions, cell 
extracts, enzyme fiactions, protein extracts, or other cell-free compositions suspected of 
containing crystal proteins or peptides. 

Generally speaking, kits in accordance with the present invention will include a suitable 
crystal protein, peptide or an antibody directed gainst such a protein or peptide, together vwth an 
unmunodetection reagent and a means for containing the antibody or antigen and reagent. The 
immunodetection reagent will typically comprise a label associated with tfie antibody or antigen, 
or associated with a secondary binding ligand. Exemplary ligands might include a secondary 
antibody directed against the first antibody or antigen or a biotin or avidin (or streptavidin) 
ligand having an associated label. Of course, as noted above, a number of exemplary labels are 
known in the art and all such labels may be employed in connection with the present invention. 

The container will generally include a vial into which the antibody, antigen or detection 
reagent may be placed, and preferably suitably aliquotted. The kits of the present invention will 
also typically include a means for containing the antibody, antigen, and reagent containers in 
close confinement for commercial sale. Such containers may include injection or blow-molded 
plastic containers into which the desired vials are retained. 

2.10 Epitopic Core Sequences 

The present invention is also directed to protein or peptide compositions, free from total 
cells and other peptides, which comprise a purified protein or peptide which incorporates an 
epitope that is immunologically cross-reactive with one or more anti-crystal protein antibodies. 
In particular, the invention concerns epitopic core sequences derived from Cry proteins or 
peptides. 

As used herein, the term "incorporating an epitope(s) that is immunologically cross- 
reactive with one or more anti-crystal protein antibodies" is intended to refer to a peptide or 
protein antigen which includes a primary, secondary or tertiary structure similar to an epitope 
located within a crystal protein or polypeptide. The level of similarity will generally be to such a 
degree that monoclonal or polyclonal antibodies directed against the crystal protein or 
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polypeptide will also bind to, react with, or otherwise recognize, the cross-reactive peptide or 
protein antigen. Various immunoassay methods may be employed in conjunction with such 
antibodies, such as, for example. Western blotting, ELISA, RIA, and the like, all of which are 
known to those of skill in the art. The identification of Cry immimodominant epitopes, and/or 
their functional equivalents, suitable for use in vaccines is a relatively straightforward matter 
(e.g. U- S. Patent 4,554,101; Jameson and Wolf, 1988; Wolf et al, 1988; U. S. Patent 
4,554,101). The amino acid sequence of these "epitopic core sequences" may then be readily 
incorporated into peptides, either through the application of peptide synthesis or recombinant 
technology. 

Preferred peptides for use in accordance with the present invention will generally be on 
the order of about 8 to about 20 amino acids in length, and more preferably about 8 to about 15 
amino acids in length. It is proposed that particular advantages of the present invention may be 
realized through the preparation of synthetic peptides which include modified and/or extended 
epitopic/immunogenic core sequences which result in a "universal" epitopic peptide directed to 
crystal proteins, and in particular CryET31, CryET40, CryET43, CryET44, CryET45, CryET46, 
CryET47, CryET49, CryETSl, CryET52, CryET53, CryET54, CryET55, CryET56, CryET57, 
CryET59, CryET60, CryET61, CiyET62, CryET63, CryET64, CryET66, CryET67, CryET68, 
CryET72, CryET73, CryET83 and related sequences. These epitopic core sequences are 
identified herein in particular aspects as hydrophilic regions of the particular polypeptide 
antigen. 

Computerized peptide sequence analysis programs (e.g., DNAStar<I> software, DNAStar, 
Inc., Madison, WI) may also be useful in designing synthetic peptides in accordance with the 
present disclosure. 

Syntheses of epitopic sequences, or peptides which include an antigenic epitope within 
their sequence, are readily achieved using conventional synthetic techniques such as the solid 
phase method (e.g., through the use of commercially available peptide synthesizer such as an 
Applied Biosystems Model 430A Peptide Synthesizer). 

2.11 Biological Functional Equivalents 

Modification and changes may be made in the structure of the peptides of the present 
invention and DNA segments which encode them and still obtain a functional molecule that 
encodes a protein or peptide with desirable characteristics. The following is a discussion based 
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upon changing the amino acids of a protein to create an equivalent, or even an improved, second- 
generation molecule. In particular embodiments of the invention, mutated crystal proteins are 
contemplated to be useful for increasing the insecticidal activity of the protein, and consequently 
increasing the insecticidal activity and/or expression of the recombinant transgene in a plant cell. 
The amino acid changes may be achieved by changing the codons of the DNA sequence, 
according to the codons given in Table 1. 
Table 1 



Amino Acids Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










AsparUc acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


uuc 


uuu 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


He 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 








CUU 


Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


w 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











For example, certain amino acids may be substituted for other amino acids in a protein 
structure without appreciable loss of interactive binding capacity with structures such as, for 
example, antigen-binding regions of antibodies or binding sites on substrate molecules. Since it 
is the interactive capacity and nature of a protein that defines that protein's biological functional 
activity, certain amino acid sequence substitutions can be made in a protein sequence, and, of 
course, its underlying DNA coding sequence, and nevertheless obtain a protein with like 
properties. It is thus contemplated by the inventors that various changes may be made in the 
peptide sequences of the disclosed compositions, or corresponding DNA sequences which 
encode said peptides v\dthout appreciable loss of their biological utility or activity. 
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In making such changes, the hydropathic index of amino acids may be considered. The 
importance of the hydropathic amino acid index in conferring interactive biologic function on a 
protein is generally understood in the art (Kyte and Doolittle, 1982, incorporate herein by 
reference). It is accepted that the relative hydropathic character of the amino acid contributes to 
the secondary structure of the resultant protein, which in turn defines the interaction of the 
protein with other molecules, for example, enzymes, substrates, receptors, DNA, antibodies, 
antigens, and the like. 

Each amino acid has been assigned a hydropathic index on the basis of their 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982), these are: isoleucine 
(+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine 
(+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine 
(-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); 
asparagine (-3.5); lysine (-3.9); and arginine (-^.5). 

It is known in the art that certain amino acids may be substituted by other amino acids 
having a similar hydropathic index or score and still result in a protein with similar biological 
activity, /.e., still obtain a biological functionally equivalent protein. In making such changes, 
the substitution of amino acids whose hydropathic indices are within ±2 is preferred, those which 
are within ±1 are particularly preferred, and those within ±0.5 are even more particularly 
preferred. 

It is also understood in the art that the substitution of like amino acids can be made 
effectively on the basis of hydrophilicity. U. S. Patent 4,554,101, incorporated herein by 
reference, states that the greatest local average hydrophilicity of a protein, as governed by the 
hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein. 

As detailed in U. S. Patent 4,554,101, the following hydrophilicity values have been 
assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); glutamate 
(+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); 
proline (-0.5 ±1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (- 
1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). 

It is understood that an amino acid can be substituted for another having a similar 
hydrophilicity value and still obtain a biologically equivalent, and in particular, an 
immunologically equivalent protein. In such changes, the substitution of amino acids whose 
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hydrophilicity values are within ±2 is preferred, those which are within ±1 are particularly 
preferred, and those within ±0.5 are even more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based on the relative 
similarity of the amino acid side-chain substituents, for example, their hydrophobicity, 
hydrophilicity, charge, size, and the like. Exemplary substitutions which take various of the 
foregoing characteristics into consideration are well known to those of skill in the art and 
include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine and 
asparagine; and valine, leucine and isoleucine. 
2.12 Insecticidal Compositions and Methods of Use 

The inventors contemplate that the crystal protein compositions disclosed herein will find 
particular utility as insecticides for topical and/or systemic application to field crops, grasses, 
finits and vegetables, and ornamental plants. In a preferred embodiment, the bioinsecticide 
composition comprises an oil flowable suspension of bacterial cells which expresses a novel 
crystal protein disclosed herein. Preferably the cells are B. thuringiensis NRRL B-21921, NRRL 
B-21922, NRRL 8-21923, NRRL B-21924, NRRL B-21925, NRRL B-21926, NRRL 8-21927, 
NRRL B-21928, NRRL B-21929, NRRL B-21930, NRRL B-21931, NRRL B-21932, NRRL 
B-21933, NRRL B-21934, NRRL B-21935, NRRL B-21936, NRRL 8-21937, NRRL 8-21938, 
NRRL B-21939, NRRL 8-21940, NRRL 8-21941, NRRL B-21942, NRRL B-21943, and NRRL 
B-21944, however, any such bacterial host cell expressing the novel nucleic acid segments 
disclosed herein and producing a crystal protein is contemplated to be useful, such as 
B. thuringiensis, B. megaterium, B, subtilis, E. coli, or Pseudomonas spp. 

In another important embodiment, the bioinsecticide composition comprises a water 
dispersible granule. This granule comprises bacterial cells which expresses a novel crystal 
protein disclosed herein. Preferred bacterial cells are B. thuringiensis NRRL B-21921, NRRL 
8-21922, NRRL B-21923, NRRL B-21924, NRRL B-21925, NRRL B-21926, NRRL B-21927, 
NRRL 8-21928, NRRL B-21929, NRRL B-21930, NRRL B-21931, NRRL 8-21932, NRRL 
8-21933, NRRL 8-21934, NRRL 8-21935, NRRL B-21936, NRRL 8-21937, NRRL B-21938, 
NRRL 8-21939, NRRL 8-21940, NRRL 8-21941, NRRL 8-21942, NRRL B-21943, and NRRL 
8-21944, however, bacteria such as B. thuringiensis, B. megaterium, B. subtilis, E. coli, or 
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Pseudomoms spp. cells transformed with a DNA segment disclosed herein and expressing the 
crystal protein are also contemplated to be useful. 

In a third important embodiment, the bioinsecticide composition comprises a wettable 
powder, dust, pellet, or collodial concentrate. This powder comprises bacterial cells which 
expresses a novel crysted protein disclosed herein. Preferred bacterial cells are B. thuringiensis 
NRRL B-21921, NRRL B-21922, NRRL B-21923, NRRL B-21924, NRRL 8-21925, NRRL 
B-21926, NRRL B-21927, NRRL B-21928, NRRL B-21929, NRRL 8-21930, NRRL B-21931, 
NRRL B-21932, NRRL B-21933, NRRL B-21934, NRRL B-21935, NRRL B-21936, NRRL 
B-21937, NRRL B-21938, NRRL B-21939, NRRL B-21940, NRRL B-2194I, NRRL B-21942, 
NRRL B'21943, and NRRL B-21944 cells, however, bacteria such as B. thuringiensis, B. 
megaterium, B. subtiliSy £1 coliy or Pseudomonas spp. cells transformed with a DNA segment 
disclosed herein and expressing the crystal protein are also contemplated to be useful. Such dry 
forms of the insecticidal compositions may be formulated to dissolve immediately upon wetting, 
or alternatively, dissolve in a controlled-release, sustained-release, or other time-dependent 
manner. 

In a fourth important embodiment, the bioinsecticide composition comprises an aqueous 
suspension of bacterial cells such as those described above which express the crystal protein. 
Such aqueous suspensions may be provided as a concentrated stock solution which is diluted 
prior to application, or alternatively, as a diluted solution ready-to-apply. 

For these methods involving application of bacterial cells, the cellular host containing the 
crystal protein gene(s) may be grown in any convenient nutrient medium, where the DNA 
construct provides a selective advantage, providing for a selective medium so that substantially 
all or all of the cells retain the B. thuringiensis gene. These cells may then be harvested in 
accordance with conventional ways. Altematively, the cells can be treated prior to harvesting. 

When the insecticidal compositions comprise intact B. thuringiensis cells expressing the 
protein of interest, such bacteria may be formulated in a variety of ways. They may be employed 
as wettable powders, granules or dusts, by mixing with various diluents, inert materials, such as 
inorganic minerals (phyllosilicates, carbonates, sulfates, phosphates, and the like) or botanical 
materials (powdered corncobs, rice hulls, walnut shells, and the like). The formulations may 
include spreader-sticker adjuvants, stabilizing agents, other pesticidal additives, or surfactants. 
Liquid formulations may be aqueous-based or non-aqueous and employed as foams, suspensions. 
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emulsifiable concentrates, or the like. The ingredients may include rheological agents, 
surfactants, emulsifiers, dispersants, or polymers. 

Alternatively, the novel insecticidal polypeptides may be prepared by native or 
recombinant bacterial expression systems in vitro and isolated for subsequent field application. 
Such protein may be either in crude cell lysates, suspensions, colloids, etc., or alternatively may 
be purified, refined, buffered, and/or fiirther processed, before formulating in an active biocidal 
formulation. Likewise, under certain circumstances, it may be desirable to isolate crystals and/or 
spores from bacterial cultures expressing the crystal protein and apply solutions, suspensions, or 
coUodial preparations of such crystals and/or spores as the active bioinsecticidal composition. 

Regardless of the method of application, the amount of the active component(s) is applied 
at an insecticidally-eflfective amount, which will vary depending on such factors as, for example, 
the specific coleopteran insects to be controlled, the specific plant or crop to be treated, the 
environmental conditions, and the method, rate, and quantity of application of the insecticidally- 
active composition. 

The insecticide compositions described may be made by formulating either the bacterial 
cell, crystal and/or spore suspension, or isolated protein component with the desired 
agriculturally-acceptable carrier. The compositions may be formulated prior to administration in 
an appropriate means such as lyophilized, freeze-dried, dessicated, or in an aqueous carrier, 
medium or suitable diluent, such as saline or other buffer. The formulated compositions may be 
in the form of a dust or granular material, or a suspension in oil (vegetable or mineral), or water 
or oil/water emulsions, or as a wettable powder, or in combination with any other carrier material 
suitable for agricultural application. Suitable agricultural carriers can be solid or liquid and are 
well known in the art. The term "agriculturally-acceptable carrier" covers all adjuvants, E. coli, 
inert components, dispersants, surfactants, tackifiers, binders, etc. that are ordinarily used in 
insecticide formulation technology; these are well known to those skilled in insecticide 
formulation. The formulations may be mixed with one or more solid or liquid adjuvants and 
prepared by various means, E. coli, by homogeneously mixing, blending and/or grinding the 
insecticidal composition with suitable adjuvants using conventional formulation techniques. 

The insecticidal compositions of this invention are applied to the environment of the 
target lepidopteran insect, typically onto the foliage of the plant or crop to be protected, by 
conventional methods, preferably by spraying. The strength and duration of insecticidal 
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application will be set with regard to conditions specific to the particular pest(s), crop(s) to be 
treated and particular environmental conditions. The proportional ratio of active ingredient to 
carrier will naturally depend on the chemical nature, solubility, and stability of the insecticidal 
composition, as well as the particular formulation contemplated. 

Other application techniques, including dusting, sprinkling, soaking, soil injection, seed 
coating, seedling coating, spraying, aerating, misting, atomizing, and the like, are also feasible 
and may be required under certain circumstances such as e.g., insects that cause root or stalk 
infestation, or for application to delicate vegetation or omamental plants. These application 
procedures are also well-known to those of skill in the art. 

The insecticidal composition of the invention may be employed in the method of the 
invention singly or in combination with other compounds, including and not. limited to other 
pesticides. The method of the invention may also be used in conjunction with other treatments 
such as surfactants, detergents, polymers or time-release formulations. The insecticidal 
compositions of the present invention may be formulated for either systemic or topical use. 

The concentration of insecticidal composition which is used for environmental, systemic, 
or foliar application will vary widely depending upon the nature of the particular formulation, 
means of application, environmental conditions, and degree of biocidal activity. Typically, the 
bioinsecticidal composition will be present in the applied formulation at a concentration of at 
least about 1% by weight and may be up to and including about 99% by weight. Dry 
formulations of the polypeptide compositions may be from about 1% to about 99% or more by 
weight of the protein composition, while liquid formulations may generally comprise from about 
1% to about 99% or more of the active ingredient by weight. Formulations which comprise 
intact bacterial cells will generally contain from about 10^ to about 10^ cells/mg. 

The insecticidal formulation may be administered to a particular plant or target area in 
one or more applications as needed, with a typical field application rate per hectare ranging on 
the order of from about SO g to about 500 g of active ingredient, or of from about SOO g to about 
1000 g, or of from about 1000 g to about 5000 g or more of active ingredient. 
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5.0 Description of Illustrative Embodiments 

5.1 Some Advantages of the Invention 

The use of B. thuringiensis insecticidal crystal protein genes for inplanta production of 
insecticidal proteins, thereby conferring insect resistance on important agronomic plants, is 
rapidly gaining commercial acceptance in the United States and abroad. The need for new 
insecticidal traits does not diminish, however, with the successful deployment of a handful of cry 
genes in plants. Concerns over the potential for insect resistance development, for instance, 
makes it imperative that an arsenal of insecticidal proteins (i.e. cry genes) be assembled to 
provide the genetic material necessary for tomorrow's insecticidal traits. In addition, transgenic 
plants producing a B. thuringiensis Cry protein may still be susceptible to damage from 
secondary insect pests, thus prompting the search for additional Cry proteins with improved 
efficacy towards these pests. The B. thuringiensis crystal proteins of the present invention 
represent a diverse collection of insecticidal proteins, including several that are toxic towards a 
lepidopteran colony exhibiting resistance to certain types of Cryl proteins. Bioassays against a 
wide range of lepidopteran pests confirm that these proteins possess insecticidal activity and, 
furthermore, that these proteins vary in efficacy against this array of target insects. This 
variation in the spectrum of insects affected by the toxin proteins suggests differing modes of 
action that may be important for future insect resistance management strategies. In planta 
expression of the cry genes of the present invention can confer insect resistance to the host plant 
as has been demonstrated for other cry genes from B. thuringiensis. 

5.2 Probes and Primers 

In another aspect, DNA sequence information provided by the invention allows for the 
preparation of relatively short DNA (or RNA) sequences having the ability to specifically 
hybridize to gene sequences of the selected polynucleotides disclosed herein. In these aspects, 
nucleic acid probes of an appropriate length are prepared based on a consideration of a selected 
crystal protein gene sequence, e.g., a sequence such as that shown in SEQ ID N0:1, SEQ ID 
NO:3, SEQ ID N0:5. SEQ ID N0:7, SEQ ID N0:9, SEQ ID NO:l 1, SEQ ID NO: 13, SEQ ID 
NO:15, SEQ ID N0:17, SEQ ID NO:19, SEQ ID N0:21, SEQ ID NO:23. SEQ ID NO:25, SEQ 
ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, 
SEQ ID NO:39, SEQ ID N0:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID 



wo 01/19859 



PCT/USOO/25361 



-29- 

NO:49 and SEQ ID NO:62. The ability of such DNAs and nucleic acid probes to specifically 
hybridize to a crystal protein-encoding gene sequence lends them particular utility in a variety of 
embodiments. Most importantly, the probes may be used in a variety of assays for detecting the 
presence of complementary sequences in a given sample. 

In certain embodiments, it is advantageous to use oligonucleotide primers. The sequence 
of such primers is designed using a polynucleotide of the present invention for use in detecting, 
amplifying or mutating a defined segment of a crystal protein gene from B. thuringiensis using 
PGR™ technology. Segments of related crystal protein genes from other species may also be 
amplified by PGR™ using such primers. 

To provide certain of the advantages in accordance with the present invention, a preferred 
nucleic acid sequence employed for hybridization studies or assays includes sequences that are 
complementary to at least a 14 to 30 or so long nucleotide stretch of a crystal protein-encoding 
sequence, such as that shown m SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, 
SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:13, SEQ ID NO:15, SEQ ID N0:17, SEQ ID 
NO: 19, SEQ ID NO:2I, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ 
ID NO:31, SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID N0:41, 
SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49 and SEQ ID NO:62. A size of 
at least about 14 or so nucleotides in length helps to ensure that the fragment will be of sufficient 
length to form a duplex molecule that is both stable and selective. Molecules having 
complementary sequences over stretches greater than about 14 or so bases in length are generally 
preferred, though, in order to increase stability and selectivity of the hybrid, and thereby improve 
the quality and degree of specific hybrid molecules obtained. One will generally prefer to design 
nucleic acid molecules having gene-complementary stretches of about 14 to about 20 or so 
nucleotides, or even longer where desired. Such fragments may be readily prepared by, for 
example, directly synthesizing the fragment by chemical means, by application of nucleic acid 
reproduction technology, such as the PGR™ technology of U. S. Patents 4,683,195, and 
4,683,202, herein incorporated by reference, or by excising selected DNA fragments from 
recombinant plasmids containing appropriate inserts and suitable restriction sites. 

5.3 Expression Vectors 

The present invention contemplates an expression vector comprising a polynucleotide of 
the present invention. Thus, in one embodiment an expression vector is an isolated and purified 
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DNA molecule comprising a promoter operatively linked to an coding region that encodes a 
polypeptide of the present invention, which coding region is operatively linked to a transcription- 
terminating region, whereby the promoter drives the transcription of the coding region. 

As used herein, the term "operatively linked" means that a promoter is connected to an 
coding region in such a way that the transcription of that coding region is controlled and 
regulated by that promoter. Means for operatively linking a promoter to a coding region are well 
known in the art. 

In a preferred embodiment, the recombinant expression of DNAs encoding the crystal 
proteins of the present invention is preferable in a Bacillus host cell. Preferred host cells include 
B. thuringiensis, B. megaterium, B. subtilis, and related bacilli, with B. thuringiensis host cells 
being highly preferred. Promoters that function in bacteria are well-known in the art. An 
exemplary and preferred promoter for the Bacillus crystal proteins include any of the known 
crystal protein gene promoters, including the cryETSJ, cryET40, cryET43, cryET44, cryET45, 
cryET46, cryET47, cryET49, cryETSl, cryET52, cryET53, cryET54, cryETSS, cryET56, 
cryET57, cryET59, cryET60, cryET6l cryET62, cryET63, cryET64, cryET66, cryET67, 
cryET68, cryET72, cryET73, and cryET83 gene promoters. Alternatively, mutagenized or 
recombinant crystal protein-encoding gene promoters may be engineered by the hand of man and 
used to promote expression of the novel gene segments disclosed herein. 

In an altemate embodiment, the recombinant expression of DNAs encoding the crystal 
proteins of the present invention is performed using a transformed Gram-negative bacterium 
such as an coli or Pseudomonas spp, host cell. Promoters which function in high-level 
expression of target polypeptides in £. coli and other Gram-negative host cells are also well- 
known in the art. 

Where an expression vector of the present invention is to be used to transform a plant, a 
promoter is selected that has the ability to drive expression in plants. Promoters that function in 
plants are also well known in the art. Useful in expressing the polypeptide in plants are 
promoters that are inducible, viral, synthetic, constitutive as described (Pos2kowski et al^ 1989; 
Odell et al, 1985), and temporally regulated, spatially regulated, and spatio-temporally regulated 
(Chaue/ fl/.,1989). 

A promoter is also selected for its ability to direct the transformed plant cell's or 
transgenic plant's transcriptional activity to the coding region. Structural genes can be driven by 
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a variety of promoters in plant tissues. Promoters can be near-constitutive, such as the CaMV 
35S promoter, or tissue-specific or developmentally specific promoters affecting dicots or 
monocots. 

Where the promoter is a near-constitutive promoter such as CaMV 35S, increases in 

5 polypeptide expression are found in a variety of transformed plant tissues (e.g., callus, leaf, seed 
and root). Alternatively, the effects of transformation can be directed to specific plant tissues by 
using plant integrating vectors containing a tissue-specific promoter. 

An exemplary tissue-specific promoter is the lectin promoter, which is specific for seed 
tissue. The Lectin protein in soybean seeds is encoded by a single gene {Lei) that is only 

10 expressed during seed maturation and accounts for about 2 to about 5% of total seed mRNA. 
The lectin gene and seed-specific promoter have been fiilly characterized and used to direct seed 
specific expression in transgenic tobacco plants (Vodkin et ai, 1983; Lindstrom ei al, 1990.) 

An expression vector containing a coding region that encodes a polypeptide of interest is 
engineered to be under control of the lectin promoter and that vector is introduced into plants 

15 using, for example, a protoplast transformation method (Dhir et al., 1991). The expression of the 
polypeptide is directed specifically to the seeds of the transgenic plant. 

A transgenic plant of the present invention produced fi-om a plant cell transformed with a 
tissue specific promoter can be crossed with a second transgenic plant developed from a plant 
cell transformed with a different tissue specific promoter to produce a hybrid transgenic plant 

20 that shows the effects of transformation in more than one specific tissue. 

Exemplary tissue-specific promoters are com sucrose synthetase 1 (Yang et al, 1990), 
com alcohol dehydrogenase 1 (Vogel ei al, 1989), com light harvesting complex (Simpson, 
1986), com heat shock protein (Odell et aL, 1985), pea small subunit RuBP carboxylase 
(Poulsen et al, 1986; Cashmore et aL, 1983), Ti plasmid mannopine synthase (Langridge et al, 

25 1989), Ti plasmid nopaline synthase (Langridge et aL, 1989), petunia chalcone isomerase (Van 
Tunen etaL, 1988), bean glycine rich protein 1 (Keller et al, 1989), CaMV 35s transcript (Odell 
et al, 1985) and Potato patatin (Wenzler et aL, 1989). Preferred promoters include a cauliflower 
mosaic virus (CaMV 35S) promoter, a S-E9 small subunit RuBP carboxylase promoter, a rice 
actin promoter, a maize histone promoter, a fused CaMV 35S'Arabidopsis histone promoter, a 

30 CaMV 35S promoter, a CaMV 19S promoter, a nos promoter, an Adh promoter, an actin 
promoter, a histone promoter, a ribulose bisphosphate carboxylase promoter, an R-allele 
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ptomoter, a root cell promoter, an a-tubulin promoter, an ABA-inducible promoter, a turgor- 
inducible promoter, a rbcS promoter, a com sucrose synthetase 1 promoter, a com alcohol 
dehydrogenase 1 promoter, a com light harvesting complex promoter, a com heat shock protein 
promoter, a pea small subunit RuBP carboxylase promoter, a Ti plasmid mannopine synthase 
promoter, a Ti plasmid nopaline synthase promoter, a petunia chalcone isomerase promoter, a 
bean glycine rich protein 1 promoter, a CaMV 35s transcript promoter, a potato patatin promoter, 
a cab promoter, a PEP-Carboxylase promoter and an S-E9 small subunit RuBP carboxylase 
promoter. 

The choice of which expression vector and ultimately to which promoter a polypeptide 
coding region is operatively linked depends directly on the functional properties desired, e.g., the 
iocation and timing of protein expression, and the host cell to be transformed. These are well 
known limitations inherent in the art of constmcting recombinant DNA molecules. However, a 
vector useful in practicing the present invention is capable of directing the expression of the 
polypeptide coding region to which it is operatively linked. 

Typical vectors useful for expression of genes in higher plants are well known in the art 
and include vectors derived from the tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens 
described (Rogers et al, 1987). However, several other plant integrating vector systems are 
known to function in plants including pCaMVCN transfer control vector described (Fromm et 
al, 1985). Plasmid pCaMVCN (available from Pharmacia, Piscataway, NJ) includes the 
cauliflower mosaic virus CaMV 35S promoter. 

In preferred embodiments, the vector used to express the polypeptide includes a selection 
marker that is effective in a plant cell, preferably a dmg resistance selection marker. One 
preferred drag resistance marker is the gene whose expression results in kanamycin resistance; 
I.e., the chimeric gene containing the nopaline synthase iwomoter, Tn5 neomycin 
phosphotransferase II (nptll) and nopaline synthase 3' non-translated region described (Rogers et 
al, 1988). 

RNA polymerase transcribes a coding DNA sequence through a site vAi&ce 
polyadenylation occurs. Typically, DNA sequences located a few hundred base pairs 
downstream of the polyadenylation site serve to terminate transcription. Those DNA sequences 
are referred to herein as transcription-termination regions. Those regions are required for 
efficient polyadenylation of transcribed messenger RNA (mRNA). 
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Means for preparing expression vectors are well known in the art. Expression 
(transformation vectors) used to transform plants and methods of making those vectors are 
described in U. S. Patents 4,971,908, 4,940,835, 4,769,061 and 4,757,011, the disclosures of 
which are incorporated herein by reference. Those vectors can be modified to include a coding 
sequence in accordance with the present invention. 

A variety of methods has been developed to operatively link DNA to vectors via 
complementary cohesive termini or blunt ends. For instance, complementary homopolymer 
tracts can be added to the DNA segment to be inserted and to the vector DNA. The vector and 
DNA segment are then joined by hydrogen bonding between the complementary homopolymeric 
tails to form recombinant DNA molecules. 

A coding region that encodes a polypeptide having the ability to confer insecticidal 
activity to a cell is preferably a CryET31, CryET40, CryET43, CryET44, CryET45, CryET46, 
CryET47, CryET49, CryETSl, CryET52, CryET53, CryET54, CryET55, CryET56, CryET57, 
CryET59, CryET60, CryET61, CryET62, CryET63, CryET64, CryET66, CryET67, CryET68, 
CryET72, CryET73, and CryET83 polypeptide-encbding gene. 

5.7 Nomenclature of the Novel Polypeptides 

The inventors have arbitrarily assigned the designation CryET31, CryET40, CTyET43, 
CryET44, CryET45, CryET46, CryET47, CryET49, CryETSl, CryET52, CryET53, CryET54, 
CryET56, CryET57, CiyET59, CryET60, CryET61, CryET62, CryET63, CryET64, CryET66, 
CryET67, CiyET68, CryET72, CryET73, and CryET83 to the polypeptides of this invention. 
Likewise, the arbitrary designations of cryETSl, cryET40, cryET43, cryET44, cryET45, 
cryET46, cryET47, cryET49, cryETSl cryET52, cryET53, cryET54, cryET56, cryET57, 
cryET59, cryET60, cryET6l cryET62, cryET63, cryET64, cryET66, cryET67, cryET68, 
cryET72, cryET73, and cryET83 have been assigned to the novel nucleic acid sequence which 
encodes these polypeptides, respectively. Formal assignment of gene and protein designations 
based on the revised nomenclature of crystal protein endotoxins will be assigned by a committee 
on the nomenclature of B. thuringiensis^ formed to systematically classify B. thuringiensis 
crystal proteins. The inventors contemplate that the arbitrarily assigned designations of the 
present invention will be superceded by the official nomenclature assigned to these sequences. 
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5.8 Transformed Host Cells and Transgenic Plants 

Methods and compositions for transforming a bacterium, a yeast cell, a plant cell, or an 
entire plant with one or more expression vectors comprising a crystal protein-encoding gene 
segment are further aspects of this disclosure. A transgenic bacterium, yeast cell, plant cell or 
plant derived from such a transformation process or the progeny and seeds ftom such a 
transgenic plant are also fiuther embodiments of the invention. 

Means for transforming bacteria and yeast cells are well known in the art. Typically, 
means of transformation are similar to those well known means used to transform other bacteria 
or yeast such as E. coli or Saccharomyces cerevisiae. Methods for DNA transformation of plant 
cells include ^gro6ocrer/wm-mediated plant transformation, protoplast transformation, gene 
transfer into pollen, injection into reproductive organs, injection into immature embryos and 
particle bombardment. Each ofthese methods has distinct advantages and disadvantages. Thus, 
one particular method of introducing genes into a particular plant strain may not necessarily be 
the most effective for another plant strain, but it is well known which metfiods are useful for a 
particular plant strain. 

There are many methods for introducing transforming DNA segments into cells, but not 
all are suitable for delivering DNA to plant cells. Suitable methods are believed to include 
virtually any method by which DNA can be introduced into a cell, such as by Agrobacterium 
infection, direct delivery of DNA such as, for example, by PEG-mediated transformation of 
protoplasts (OmiruUeh et ai, 1993), by desiccation/inhibition-mediated DNA uptake, by 
electroporation, by agitation witii silicon cart>ide fibers, by acceleration of DNA coated particles, 
etc. In certain embodiments, acceleration metiiods are preferred and include, for example, 
microprojectile bombardment and the like. 

Technology for introduction of DNA into cells is well-known to tiiose of skill in tiie art. 
Four general methods for delivering a gene into cells have been described: (1) chemical methods 
(Graham and van der Eb, 1973; Zatloukal et al, 1992); (2) physical metiiods such as 
microinjection (Capecchi, 1980), electroporation (Wong and Neumann, 1982; Fromm et al., 
1985; U. S. Patent No. 5,384,253) and tiie gene gun (Johnston and Tang, 1994; Fynan et al., 
1993); (3) viral vectors (Clapp, 1993; Lu et ai, 1993; Eglitis and Anderson, 1988a; 1988b); and 
(4) receptor-mediated mechanisms (Curiel et al., 1991; 1992; Wagner et al, 1992). 
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5.83 Agrobacterium-Mediated Transfer 

Agrobacterium-m^diaXcd transfer is a widely applicable system for introducing genes into 
plant cells because the DNA can be introduced into whole plant tissues, thereby bypassing the 
need for regeneration of an intact plant from a protoplast. The use of Agrobacterium-mediaicd 
plant integrating vectors to introduce DNA into plant cells is well known in the art. See, for 
example, the methods described (Fraley et aL, 1985; Rogers et al^ 1987). Further, the 
integration of the Ti-DNA is a relatively precise process resulting in few rearrangements. The 
region of DNA to be transferred is defined by the border sequences, and intervening DNA is 
usually inserted into the plant genome as described (Spieimann et aly 1986; Jorgensen et aly 
1987). 

Modem Agrobacterium transformation vectors are capable of replication in E. coli as 
well as Agrobacterium, allowing for convenient manipulations as described (Klee et al.j 1985). 
Moreover, recent technological advances in vectors for Agrobacterium-medisitcd gene transfer 
have improved the arrangement of genes and restriction sites in the vectors to facilitate 
construction of vectors capable of expressing various polypeptide coding genes. The vectors 
described (Rogers et al., 1987), have convenient multi*linker regions flanked by a promoter and 
a polyadenylation site for direct expression of inserted polypeptide coding genes and are suitable 
for present piirposes. In addition, Agrobacterium containing both armed and disarmed Ti genes 
can be used for the transformations. In those plant strains where Agrobacterium-mcdiated 
transformation is efficient, it is the method of choice because of the facile and defined nature of 
the gene transfer. 

Agrobacterium-mediated transformation of leaf disks and other tissues such as 
cotyledons and hypocotyls appears to be limited to plants that Agrobacterium naturally infects, 
Agrobacterium-mcdiaied transformation is most efficient in dicotyledonous plants. Few 
monocots appear to be natural hosts for Agrobacterium, although transgenic plants have been 
produced in asparagus using Agrobacterium vectors as described (Bytebier et al,, 1987). 
Therefore, commercially important cereal grains such as rice, com, and wheat must usually be 
transformed using alternative methods. However, as mentioned above, the transformation of 
asparagus using Agrobacterium can also be achieved (see, for example, Bytebier et aL, 1 987). 

A transgenic plant formed using Agrobacterium transformation methods typically 
contains a single gene on one chromosome. Such transgenic plants can be referred to as being 
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heterozygous for the added gene. However, inasmuch as use of the word "heterozygous" usually 
implies the presence of a complementary gene at the same locus of the second chromosome of a 
pair of chromosomes, and there is no such gene in a plant containing one added gene as here, it is 
believed that a more accurate name for such a plant is an independent segregant, because the 
added, exogenous gene segregates independently during mitosis and meiosis. 

More preferred is a transgenic plant that is homozygous for the added structural gene; 
Le., a transgenic plant that contains two added genes, one gene at the same locus on each 
chromosome of a chromosome pair. A homozygous transgenic plant can be obtained by sexually 
mating (selfmg) an independent segregant transgenic plant that contains a single added gene, 
germinating some of the seed produced and analyzing the resulting plants produced for enhanced 
carboxylase activity relative to a control (native, non-transgenic) or an independent segregant 
transgenic plant. 

It is to be understood that two different transgenic plants can also be mated to 
produce offspring that contain two independently segregating added, exogenous genes. Selfing 
of appropriate progeny can produce plants that are homozygous for both added, exogenous genes 
that encode a polypeptide of interest. Back-crossing to a parental plant and out-crossing with a 
non-transgenic plant are also contemplated. 

Transformation of plant protoplasts can be achieved using methods based on calcium 
phosphate precipitation, polyethylene glycol treatment, electroporation, and combinations of 
these treatments (see, e.g., Potrykus et al, 1985; Lorz et ai, 1985; Fromni et ai, 1985; Uchimiya 
et al, 1986; Callis et al, 1987; Marcotte et al, 1988). 

Application of these systems to different plant strains depends upon the ability to 
regenerate that particular plant strain from protoplasts. Illustrative methods for the regeneration 
of cereals from protoplasts are described (Fujimura et ah, 1985; Toriyama et al, 1986; Yamada 
et al, 1986; Abdullah et al, 1986). 
5.8.4 Other Transformation Methods 

Transformation of plant protoplasts can be achieved using methods based on calcium 
phosphate precipitation, polyethylene glycol treatment, electroporation, and combinations of 
these treatments (see, e.g., Potrykus etal, 1985; Lorz era/., 1985;Fromm etal, 1985; Uchimiya 
et al, 1986; Callis et al, 1987; Marcotte et al, 1988). 
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lUustrative methods for the regeneration of cereals from protoplasts are described 
(Fujimurae/aA, 1985; Toriyama e/a/., 1986; Yamada era/., 1986; Abdullah era/., 1986). 

5.8.5 Gene Expression in Plants 

Although great progress has been made in recent years with respect to preparation of 
transgenic plants which express bacterial proteins such as B. thuringiensis crystal proteins, the 
results of expressing native bacterial genes in plants are often disappointing. In recent years, 
however, several potential factors have been implicated as responsible in varying degrees for the 
level of protein expression from a particular coding sequence. For example, scientists now know 
that maintaining a significant level of a particular mRNA in the cell is indeed a critical factor. 
Unfortunately, the causes for low steady state levels of mRNA encoding foreign proteins are 
many. First, full length RNA synthesis may not occur at a high frequency. This could, for 
example, be caused by the premature termination of RNA during transcription or due to 
unexpected mRNA processing during transcription. Second, full length RNA may be produced 
in the plant cell, but then processed (splicing, polyA addition) in the nucleus in a fashion that 
creates a nonfunctional mRNA. If the RNA is not properly synthesized, terminated and 
polyadenylated, it cannot move to the cytoplasm for translation. Similarly, in the cytoplasm, if 
mRNAs have reduced half lives (which are determined by their primary or secondary sequence) 
inisufficient protein product will be produced. In addition, there is an effect, whose magnitude is 
uncertain, of translational efficiency on mRNA half-life. In addition, every RNA molecule folds 
into a particular structure, or perhaps family of structures, which is determined by its sequence. 
The particular structure of any RNA might lead to greater or lesser stability in the cytoplasm. 
Structure per se is probably also a determinant of mRNA processing in the nucleus. It is likely 
that dramatically changing the sequence of an RNA will have a large effect on its folded 
structure It is likely that structure per se or particular structural features also have a role in 
determining RNA stability. 

To overcome these limitations in foreign gene expression, researchers have identified 
particular sequences and signals in RNAs that have the potential for having a specific effect on 
RNA stability. In certain embodiments of the invention, therefore, there is a desire to optimize 
expression of the disclosed nucleic acid segments in planta. One particular method of doing so, 
is by alteration of the bacterial gene to remove sequences or motifs which decrease expression in 
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a transformed plant cell. The process of engineering a coding sequence for optimal expression in 
planta is often referred to as ""plantizing"* a DNA sequence. 

Particularly problematic sequences are those which are A+T rich. Unfortunately, since 
5. thuringiensis has an A+T rich genome, native crystal protein gene sequences must often be 
modified for optimal expression in a plant. The sequence motif ATTTA (or AUUUA as it 
appears in RNA) has been implicated as a destabilizing sequence in mammalian cell mRNA 
(Shaw and Kamen, 1986). Many short lived mRNAs have A+T rich 3' untranslated regions, and 
these regions often have the ATTTA sequence, sometimes present in multiple copies or as 
multimers (e.g., ATTTATTTA...). Shaw and Kamen showed that the transfer of the 3' end of an 
unstable mRNA to a stable RNA (globin or VAl) decreased the stable RNA's half life 
dramatically. They further showed that a pentamer of ATTTA had a profound destabilizing 
effect on a stable message, and that this signal could exert its effect whether it was located at the 
y end or within the coding sequence. However, the number of ATTTA sequences and/or the 
sequence context in which they occur also appear to be important in determining whether they 
function as destabilizing sequences. Shaw and Kamen showed that a trimer of ATTTA had 
much less effect than a pentamer on mRNA stability and a dimer or a monomer had no effect on 
stability (Shaw and Kamen, 1987). Note that multimers of ATTTA such as a pentamer 
automatically create an A+T rich region. This was shown to be a cytoplasmic effect, not nuclear. 
In other unstable mRNAs, the ATTTA sequence may be present in only a single copy, but it is 
often contained in an A+T rich region. From the animal cell data collected to date, it appears 
that ATTTA at least in some contexts is important in stability, but it is not yet possible to predict 
which occurrences of ATTTA are destabiling elements or whether any of these effects are likely 
to be seen in plants. 

Some studies on mRNA degradation in animal cells also indicate that RNA degradation 
may begin in some cases with nucleolytic attack in A+T rich regions. It is not clear if these 
cleavages occur at ATTTA sequences. There are also examples of mRNAs that have differential 
stability depending on the cell type in which they are expressed or on the stage within the cell 
cycle at which they are expressed. For example, histone mRNAs are stable during DNA 
synthesis but unstable if DNA synthesis is disrupted. The 3' end of some histone mRNAs seems 
to be responsible for this effect (Pandey and Marzluff, 1987). It does not appear to be mediated 
by ATTTA, nor is it clear what controls the differential stability of this mRNA. Another 
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example is the differential stability of IgG mRNA in B lymphocytes during B cell maturation 
(Genovese and Milcarek, 1988). A final example is the instability of a mutant p-thallesemic 
globin mRNA. In bone marrow cells, where this gene is normally expressed, the mutant mRNA 
is unstable, while the wild-type mRNA is stable. When the mutant gene is expressed in HeLa or 
L cells in vitro, the mutant mRNA shows no instability (Lim et al, 1988). These examples all 
provide evidence that mRNA stability can be mediated by cell type or cell cycle specific factors. 
Furthermore this type of instability is not yet associated with specific sequences. Given these 
uncertainties, it is not possible to predict which RNAs are likely to be unstable in a given cell. In 
addition, even the ATTTA motif may act differentially depending on the nature of the cell in 
which the RNA is present. Shaw and Kamen (1987) have reported that activation of protein 
kinase C can block degradation mediated by ATTTA. 

The addition of a poly adenylate string to the 3' end is common to most eukaryotic 
mRNAs, both plant and animal. The currently accepted view of polyA addition is that the 
nascent transcript extends beyond the mature 3' terminus. Contained within this transcript are 
signals for polyadenylation and proper 3' end formation. This processing at the 3' end involves 
cleavage of the mRNA and addition of polyA to the mature 3' end. By searching for consensus 
sequences near the polyA tract in both plant and animal mRNAs, it has been possible to identify 
consensus sequences that apparently are involved in polyA addition and 3' end cleavage. The 
same consensus sequences seem to be important to both of these processes. These signals are 
typically a variation on the sequence AATAAA. In animal cells, some variants of this sequence 
that are functional have been identified; in plant cells there seems to be an extended range of 
functional sequences (Wickens and Stephenson, 1984; Dean et al, 1986). Because all of these 
consensus sequences are variations on AATAAA, they all are A+T rich sequences. This 
sequence is typically found 15 to 20 bp before the polyA tract in a mature mRNA. Studies in 
animal cells indicate that this sequence is involved in both polyA addition and 3' maturation. 
Site directed mutations in this sequence can disrupt these functions (Conway and Wickens, 1988; 
Wickens et al, 1987). However, it has also been observed that sequences up to 50 to 100 bp 3' 
to the putative polyA signal are also required; i.e., a gene that has a normal AATAAA but has 
been replaced or disrupted dovmstream does not get property polyadenylated (Gil and Proudfoot, 
1984; Sadofsky and Alwine, 1984; McDevitt et al, 1984). That is, the polyA signal itself is not 
sufficient for complete and proper processing. It is not yet known what specific downstream 
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sequences are required in addition to the polyA signal, or if there is a specific sequence that has 
this function. Therefore, sequence analysis can only identify potential polyA signals. 

In naturally occurring mRNAs that are normally polyadenylated, it has been observed 
that disruption of this process, either by altering the polyA signal or other sequences in the 
mRNA, profound effects can be obtained in the level of functional mRNA. This has been 
observed in several naturally occurring mRNAs, with results that are gene-specific so far. 

It has been shown that in natural mRNAs proper polyadenylation is important in mRNA 
accumulation, and that disruption of this process can effect mRNA levels significanUy. 
However, insufficient knowledge exists to predict the effect of changes in a normal gene. In a 
heterologous gene, it is even harder to predict the consequences. However, it is possible that the 
putative sites identified are dysfunctional. That is. these sites may not act as proper polyA sites, 
but instead function as aberrant sites that give rise to unstable mRNAs. 

In animal cell systems, AATAAA is by far the most common signal identified in mRNAs 
upstream of the polyA, but at least four variants have also been found (Wickens and Stephenson, 
1984). In plants, not nearly so much analysis has been done, but it is clear that multiple 
sequences similar to AATAAA can be used. The plant sites in Table 2 called major or minor 
refer only to the study of Dean et al. (1986) which analyzed only three types of plant gene. The 
designation of polyadenylation sites as major or minor refers only to the frequency of their 
occurrence as functional sites in naturally occurring genes that have been analyzed. In the case 
of plants this is a very limited database. It is hard to predict with any certainty that a site 
designated major or minor is more or less likely to function partially or completely when found 
in a heterologous gene such as those encoding the crystal proteins of the present invention. 
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The present invention provides a method for preparing synthetic plant genes which genes 
express their protein product at levels significantly higher than the wild-type genes which were 
commonly employed in plant transfonnation heretofore. In another aspect, the present invention 
also provides novel synthetic plant genes which encode non-plant proteins. 

As described above, the expression of native A thuringiensis genes in plants is often 
problematic. The nature of the coding sequences of B. thuringiensis genes distinguishes them 
from plant genes as well as many other heterologous genes expressed in plants. In particular, 
B, thuringiensis genes are very rich (-62%) in adenine (A) and thymme (T) while plant genes 
and most other bacterial genes which have been expressed in plants are on the order of 45-55% 
A+T. 

Due to the degeneracy of the genetic code and the limited number of codon choices for 
any amino acid, most of the "excess" A+T of the structural coding sequences of some Bacillus 
species are found in the third position of the codons. That is, genes of some Bacillus species 
have A or T as the third nucleotide in many codons. Thus A+T content in part can determine 
codon usage bias. In addition, it is clear that genes evolve for maximimi function in the 
organism in which they evolve. This means that particular nucleotide sequences foimd in a gene 
from one organism, where they may play no role except to code for a particular stretch of amino 
acids, have the potential to be recognized as gene control elements in another organism (such as 
transcriptional promoters or terminators, polyA addition sites, intron splice sites, or specific 
mRNA degradation signals). It is perhaps surprising that such misread signals are not a more 
common feature of heterologous gene expression, but this can be explained in part by the 
relatively homogeneous A+T content (-50%) of many organisms. This A+T content plus the 
nature of the genetic code put clear constraints on the likelihood of occurrence of any particular 
oligonucleotide sequence. Thus, a gene from E. coli with a 50% A+T content is much less likely 
to contain any particular A+T rich segment than a gene from B. thuringiensis. 

Typically, to obtain high-level expression of the S-endotoxin genes in plants, existing 
structural coding sequence ("structural gene") which codes for the S-endotoxin are modified by 
removal of ATTTA sequences and putative polyadenylation signals by site directed mutagenesis 
of the DNA comprising the structural gene. It is most preferred that substantially all the 
polyadenylation signals and ATTTA sequences are leinoved although enhanced expression 
levels are observed with only partial removal of either of the above identified sequences. 
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Altemately if a synthetic gene is prepared which codes for the expression of the subject protein, 
codons are selected to avoid the ATTTA sequence and putative polyadenylation signals. For 
purposes of the present invention putative polyadenylation signals include, but are not 
necessarily Ihnited to, AATAAA, AATAAT, AACCAA, ATATAA, AATCAA, ATACTA, 
ATAAAA, ATGAAA, AAGCAT, ATTAAT, ATACAT, AAAATA, ATTAAA, AATTAA, 
AATACA and CATAAA. In replacing the ATTTA sequences and polyadenylation signals, 
codons are preferably utilized which avoid the codons which are rarely found in plant genomes. 

The selected DNA sequence is scanned to identify regions with greater than four 
consecutive adenine (A) or thymine (T) nucleotides. The A+T regions are scanned for potential 
plant polyadenylation signals. Although the absence of five or more consecutive A or T 
nucleotides eliminates most plant polyadenylation signals, if there are more than one of the 
minor polyadenylation signals identified within ten nucleotides of each other, then the nucleotide 
sequence of this region is preferably altered to remove these signals while maintaining the 
original encoded amino acid sequence. 

The second step is to consider the about 15 to about 30 or so nucleotide residues 
surrounding the A+T rich region identified in step one. If the A+T content of the surrounding 
region is less than 80%, the region should be examined for polyadenylation signals. Alteration 
of the region based on polyadenylation signals is dependent upon (1) the number of 
polyadenylation signals present and (2) presence of a major plant polyadenylation signal. 

The extended region is examined for the presence of plant polyadenylation signals. The 
polyadenylation signals are removed by site-directed mutagenesis of the DNA sequence. The 
extended region is also examined for multiple copies of the ATTTA sequence which are also 
removed by mutagenesis. 

It is also preferred that regions comprising many consecutive A-*-T bases or G+C bases 
are disrupted since these regions are predicted to have a higher likelihood to form hairpin 
structure due to self-complementarity. Therefore, insertion of heterogeneous base pairs would 
reduce the likelihood of self-complementary secondary structure formation which are known to 
inhibit transcription and/or translation in some organisms. In most cases, the adverse effects may 
be minimized by using sequences which do not contain more than five consecutive A+T or G+C. 
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5.8.6 Synthetic Oligonucleotides for Mutagenesis 

When oligonucleotides are used in the mutagenesis, it is desirable to maintain the proper 
amino acid sequence and reading frame, without introducing comniion restriction sites such as 
iBg/II, HindllU Sad, Kpnl, EcoYCL, Ncol, Pstl and Sail into the modified gene. These restriction 
sites are found in poly-linker insertion sites of many cloning vectors. Of course, the introduction 
of new polyadenylation signals, ATTTA sequences or consecutive stretches of more than five 
A+T or G+C, should also be avoided. The preferred size for the oligonucleotides is about 40 to 
about 50 bases, but fragments ranging from about 18 to about 100 bases have been utilized. In 
most cases, a minimum of about 5 to about 8 base pairs of homology to the template DNA on 
both ends of the synthesized fragment are maintained to insure proper hybridization of the primer 
to the template. The oligonucleotides should avoid sequences longer than five base pairs A+T or 
G+C. Codons used in the replacement of wild-type codons should preferably avoid the TA or 
CG doublet wherever possible. Codons are selected from a plant preferred codon table (such as 
Table 3 below) so as to avoid codons which are rarely found in plant genomes, and efforts should 
be made to select codons to preferably adjust the G+C content to about 50%. 

Regions with many consecutive A+T bases or G+C bases are predicted to have a higher 
likelihood to form hairpin structures due to self-complementarity. Disruption of these regions by 
the insertion of heterogeneous base pairs is preferred and should reduce the likelihood of the 
formation of self-complementary secondary structures such as hairpins which are known in some 
organisms to inhibit transcription (transcriptional terminators) and translation (attenuators). 

Alternatively, a completely synthetic gene for a given amino acid sequence can be 
prepared, with regions of five or more consecutive A+T or G+C nucleotides being avoided. 
Codons are selected avoiding the TA and CG doublets in codons whenever possible. Codon 
usage can be normalized against a plant preferred codon usage table (such as Table 3) and the 
G+C content preferably adjusted to about 50%. The resulting sequence should be examined to 
ensure that there are minimal putative plant polyadenylation signals and ATTTA sequences. 
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Table 3 - Preferred Codon Usage in Plants 
Amino Codon Percent Usage in Plants Amino Codon Percent Usage in Plants 
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Restriction sites found in commonly used cloning vectors are also preferably avoided. 
However, placement of several unique restriction sites throughout the gene is useful for analysis 
of gene expression or construction of gene variants. 

5 5.8.7 "Plantized" Gene Constructs 

The expression of a plant gene which exists in double-stranded DNA form involves 
transcription of messenger RNA (mRNA) from one strand of the DNA by RNA polymerase 
enzyme, and the subsequent processing of the mRNA primary transcript inside the nucleus. This 
processing involves a 3' non-translated region which adds polyadenylate nucleotides to the 3' 

10 end of the RNA. Transcription of DNA into mRNA is regulated by a region of DNA usually 
referred to as the "promoter." The promoter region contains a sequence of bases that signals 
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RNA polymerase to associate with the DNA and to initiate the transcription of mRNA using one 
of the DNA strands as a template to make a corresponding strand of RNA. 

A number of promoters which are active in plant cells have been described in the 
literature. These include the nopaline synthase (NOS) and octopine synthase (OCS) promoters 
5 (which are carried on tumor-inducing plasmids of A, tumefaciens), the Cauliflower Mosaic Virus 
(CaMV) 19S and 35S promoters, the light-inducible promoter from the small subunit of ribulose 
bis-phosphate carboxylase (ssRUBISCO, a very abundant plant polypeptide) and the mannopine 
synthase (MAS) promoter (Velten et al, 1984; Velten and Schell, 1985). All of these promoters 
have been used to create various types of DNA constructs which have been expressed in plants 

10 (see e.g., Intl. Pat. Appl. Publ. Sen No. WO 84/02913). 

Promoters which are known or are found to cause transcription of RNA in plant cells can 
be used in the present invention. Such promoters may be obtained from plants or plant viruses 
and include, but are not limited to, the CaMV35S promoter and promoters isolated from plant 
genes such as ssRUBlSCO genes. As described below, it is preferred that the particular 

15 promoter selected should be capable of causing sufficient expression to result in the production 
of an effective amount of protein. 

The promoters used in the DNA constructs (i.e. chimeric plant genes) of the present 
invention may be modified, if desired, to affect their control characteristics. For example, the 
CaMV35S promoter may be ligated to the portion of the ssRUBISCO gene that represses the 

20 expression of ssRUBISCO in the absence of light, to create a promoter which is active in leaves 
but not in roots. The resulting chimeric promoter may be used as described herein. For purposes 
of this description, the phrase "CaMV35S" promoter thus includes variations of CaMV35S 
promoter, e.g., promoters derived by means of ligation with operator regions, random or 
controlled mutagenesis, etc. Furthermore, the promoters may be altered to contain multiple 

25 "enhancer sequences" to assist in elevating gene expression. 

The RNA produced by a DNA construct of the present invention also contains a 5' non- 
translated leader sequence. This sequence can be derived from the promoter selected to express 
the gene, and can be specifically modified so as to increase translation of the mRNA. The 5' 
non-translated regions can also be obtained from viral RNA's, from suitable eukaryotic genes, or 

30 from a synthetic gene sequence. The present invention is not limited to constructs, as presented 
in the following examples. Rather, the non-translated leader sequence can be part of the 5' end 
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of the non-translated region of the coding sequence for the virus coat protein, or part of the 
promoter sequence, or can be derived from an unrelated promoter or coding sequence. In any 
case, it is preferred that the sequence flanking the initiation site conform to the translational 
consensus sequence rules for enhanced translation initiation reported by Kozak (1984). 

The cry DNA constructs of the present invention may also contain one or more modified 
or fully-synthetic structural coding sequences which have been changed to enhance the 
performance of the cry gene in plants. The structural genes of the present invention may 
optionally encode a fusion protein comprising an amino-terminal chloroplast transit peptide or 
secretory signal sequence. 

The DNA construct also contains a 3' non-translated region. The 3' non-translated region 
contains a polyadenylation signal which functions in plants to cause the addition of 
polyadenylate nucleotides to the 3' end of the viral RNA. Examples of suitable 3' regions are (1) 
the 3' transcribed, non-translated regions containing the polyadenylation signal of Agrobacterium 
tumor-inducing (Ti) plasmid genes, such as the nopaline synthase (NOS) gene, and (2) plant 
genes like the soybean storage protein (7S) genes and the small subunit of the RuBP carboxylase 
(E9) gene. 

5.9 Methods for Producing Insect-Resistant Transgenic Plants 

By transforming a suitable host cell, such as a plant cell, with a recombinant cryET3J, 
cryET40, cryET43. cryET44, cryET45, cryET46, cryET47, cryET49, cryETSl cryET52, 
cryET53, cryET54, cryET56, cryET57, cryET59. cryET60, cryET6l cryET62, cryET63, 
cryET64, cryET66, cryET67, cryET68, cryET72, cryET73, and cryET83 gene-containing 
segment, the expression of the encoded crystal protein (/.e., a bacterial crystal protein or 
polypeptide having insecticidal activity against coleopterans) can result in the formation of 
insect-resistant plants. 

By way of example, one may utilize an expression vector containing a coding region for a 
B, thuringiensis crystal protein and an appropriate selectable marker to transform a suspension of 
embryonic plant cells, such as wheat or com cells using a method such as particle bombardment 
(Maddock et al, 1991; Vasil et al, 1992) to deliver the DNA coated on microprojectiles into the 
recipient cells. Transgenic plants are then regenerated from transformed embryonic calli that 
express the insecticidal proteins. 
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The formation of transgenic plants may also be accomplished using other methods of cell 
transformation vMch are known in the art such as Agrobacterium-mediaicd DNA transfer 
(Fraley et al, 1983). Alternatively, DNA can be introduced into plants by direct DNA transfer 
into pollen (Zhou et al, 1983; Hess, 1987; Luo et al, 1988), by injection of the DNA into 
reproductive organs of a plant (Pena et al, 1987), or by direct injection of DNA into the cells of 
immature embryos followed by the rehydration of desiccated embryos (Neuhaus et aiy 1987; 
Benbrook etal^ 1986). 

The regeneration, development, and cultivation of plants from single plant protoplast 
transformants or from various transformed explants is well known in the art (Weissbach and 
Weissbach, 1988). This regeneration and growth process typically includes the steps of selection 
of transformed cells, culturing those individualized cells through the usual stages of embryonic 
development through the rooted plantlet stage. Transgenic embryos and seeds are similarly 
regenerated. The resulting transgenic rooted shoots are thereafter planted in an appropriate plant 
growth medium such as soil. 

The development or regeneration of plants containing the foreign, exogenous gene that 
encodes a polypeptide of interest introduced by Agrobacterium from leaf explants can be 
achieved by methods well known in the art such as described (Horsch a/., 1985). In this 
procedure, transformants are cultured in the presence of a selection agent and in a medium that 
induces the regeneration of shoots in the plant strain being transformed as described (Fraley et 
al, 1983). 

This pitjcedure typically produces shoots within two to four months and those shoots are 
then transferred to an appropriate root-inducing medium containing the selective agent and an 
antibiotic to prevent bacterial growth. Shoots that rooted in the presence of the selective agent to 
form plantlets are then transplanted to soil or other media to allow the production of roots. These 
procedures vary depending upon the particular plant strain employed, such variations being well 
known in the art. 

Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic 
plants, as discussed before. Otherwise, pollen obtained from the regenerated plants is crossed to 
seed-grown plants of agronomically important, preferably inbred lines. Conversely, pollen from 
plants of those important lines is used to pollinate regenerated plants. A transgenic plant of the 
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present invention containing a desired polypeptide is cultivated using methods well known to 
one skilled in the art. 

A transgenic plant of this invention thiis has an increased amount of a coding region {eg., 
a cryETSl. cryET40, cryET43. cryET44, cryET45. cryET46. cryET47. cryET49, cryETSl, 
cryET52. cryET53, cryET54, cryET56. cryETSl, cryET59, cryET60, cryET6I. cryET62, 
cryET63. cryET64, cryET66, cryET67, cryET68, cryET72. cryET73, and cryET83 gene) that 
encodes one or more CryET31, CryET40. CryET43, CryET44, CryET45, CiyET46. CryET47, 
CryET49, CryETSl, CryET52, CryET53. CryET54, CryET56, CryET57, CryET59, CryET60. 
CryET61. CiyET62, CryET63, CryET64. CryET66, CryET67, CryET68, CryET72, CryET73, 
and CryET83 polypeptides. A preferred transgenic plant is an independent segregant and can 
transmit that gene and its activity to its progeny. A more preferred transgenic plant is 
homozygous for that gene, and transmits that gene to all of its offspring on sexual mating. Seed 
from a transgenic plant may be grown in the field or greenhouse, and resulting sexually mature 
transgenic plants are self-pollinated to generate true breeding plants. The progeny from these 
plants become true breeding lines that are evaluated for, by way of example, increased 
insecticidal capacity against coleopteran insects, preferably in the field, under a range of 
environmental conditions. The inventors contemplate that tiie present invention will find 
particular utility in the creation of transgenic plants of commercial interest including various turf 
grasses, wheat, com, rice, barley, oats, a variety of ornamental plants and vegetables, as well as a 
number of nut- and fruit-bearing trees and plants. 

5.10 Definitions 

The following words and phrases have the meanings set forth below. 

Expression: The combination of intracellular processes, including transcription and 
translation undergone by a coding DNA molecule such as a structural gene to produce a 
polypeptide. 

Identity or percent identity: refers to the degree of similarity between two nucleic acid 
or protein sequences. An alignment of the two sequences is performed by a suitable computer 
program. A widely used and accepted computer program for performing sequence alignments is 
CLUSTALW vl.6 (Thompson, et al. Nucl. Acids Res., 22: 4673-4680, 1994). The number of 
matching bases or amino acids is divided by the total number of bases or amino acids, and 
multiplied by 100 to obtain a percent identity. For example, if two 580 base pair sequences had 
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145 matched bases, they would be 25 percent identic^. If the two compared sequences are of 
different lengths, the number of matches is divided by the shorter of the two lengths. For 
example, if there were 100 matched amino acids between 200 and a 400 amino acid proteins, 
they are 50 percent identical with respect to the shorter sequence. If the shorter sequence is less 
5 than 150 bases or 50 amino acids in length, the number of matches are divided by 150 (for 
nucleic acid bases) or 50 (for amino acids), and multiplied by 100 to obtain a percent identity. 

Promoter: A recognition site on a DNA sequence or group of DNA sequences that 
provide an expression control element for a structural gene and to which RNA polymerase 
specifically binds and initiates RNA synthesis (transcription) of that gene. 
10 RegeneratlonzThe process of growing a plant from a plant cell (e.g., plant protoplast or 

explant). 

Structural gene: A polynucleotide sequence that encodes a polypeptide, that is 
expressed to produce a polypeptide, or which is cryptic or incapable of expression in its natural 
host cell but which can be isolated and purified and operably linked to at least a promoter 

15 functional in one or more host cell types to express the encoded polypeptide. 

Transformation: A process of introducing an exogenous DNA sequence (e.g., a 
vector, a recombinant DNA molecule) into a cell or protoplast in which that exogenous DNA is 
incorporated into a chromosome or is capable of autonomous replication. 

Transformed cell: A cell whose DNA has been altered by the introduction of an 

20 exogenous DNA molecule into that cell. 

Transgenic cell: Any cell derived or regenerated from a transformed cell or derived 
from a transgenic cell. Exemplary transgenic cells include plant calli derived from a transformed 
plant cell and particular cells such as leaf, root, stem, e.g., somatic cells, or reproductive (germ) 
cells obtained from a transgenic plant. 

25 Transgenic plant: A plant or progeny thereof derived from a transformed plant cell or 

protoplast, wherein the plant DNA contains an introduced exogenous DNA molecule not 
originally present in a native, non-transgenic plant of the same strain. The terms "transgenic 
plant" and "transformed plant" have sometimes been used in the art as synonymous terms to 
define a plant whose DNA contains an exogenous DNA molecule. However, it is thought more 

30 scientifically correct to refer to a regenerated plant or callus obtained from a transformed plant 
cell or protoplast as being a transgenic plant, and that usage will be followed herein. 
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Vector: A DNA molecule capable of replication in a host cell and/or to which 
another DNA segment can be operatively linked so as to bring about replication of the attached 
segment. A plasmid is an exemplary vector. 
5.11 Isolating Homologotts Gene and Gene Fragments 

The genes and 5-endotoxins according to the subject invention include not only the full 
length sequences disclosed herein but also Augments of these sequences, or fiision proteins, 
which retain the characteristic insecticidal activity of the sequences specifically exemplified 
herein. 

It should be apparent to a person skill in this art that insecticidal 5-endotoxins can be 
identified and obtained through several means. The specific genes, or portions thereof, may be 
obtained from a culture depository, or constructed synthetically, for example, by use of a gene 
machine. Variations of these genes may be readily constructed using standard techniques for 
making point mutations. Also, fragments of these genes can be made using commercially 
available exonucleases or endonucleases according to standard procedures. For example, 
enzymes such as Sa/31 or site-directed mutagenesis can be used to systematically cut off 
nucleotides from the ends of these genes. Also, genes which code for active firagments may be 
obtained using a variety of other restriction enzymes. Proteases may be used to direcUy obtain 
active fragments of these 5-endotoxins. 

Equivalent 5-endotoxins and/or genes encoding these equivalent 8-endotoxins can also be 
isolated from Bacillus strains and/or DNA libraries using the teachings provided herein. For 
example, antibodies to the 5-endotoxins disclosed and claimed herein can be used to identify and 
isolate other 5-endotoxins from a mixture of proteins. Specifically, antibodies may be raised to 
the portions of the 6-endotoxins which are most constant and most distinct from other B. 
thuringiensis 5-endotoxins. These antibodies can then be used to specifically identify equivalent 
6-endotoxins with the characteristic insecticidal activity by immunoprecipitation, enzyme linked 
immunoassay (ELISA), or Western blotting. 

A further method for identifying the S-endotoxins and genes of the subject invention is 
through the use of oligonucleotide probes. These probes are nucleotide sequences having a 
detectable label. As is well known in the art, if the probe molecule and nucleic acid sample 
hybridize by forming a strong bond between the two molecules, it can be reasonably assumed 
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that the probe and sample are essentially identical. The probe's detectable label provides a 
means for determining in a known manner whether hybridization has occurred. Such a probe 
analysis provides a rapid method for identifying formicidai §-endotoxin genes of the subject 
invention. 

The nucleotide segments which are used as probes according to the invention can be 
synthesized by use of DNA synthesizers using standard procedures. In the use of the nucleotide 
segments as probes, the particular probe is labeled with any suitable label known to those skilled 
in the art, including radioactive and non-radioactive labels. Typical radioactive labels include 
^^P, ^^^I, ^^S, or the like. A probe labeled with a radioactive isotope can be constructed from a 
nucleotide sequence complementary to the DNA sample by a conventional nick translation 
reaction, using a DNase and DNA polymerase. The probe and sample can then be combined in a 
hybridization buffer solution and held at an appropriate temperature until annealing occurs. 
Thereafter, the membrane is washed free of extraneous materials, leaving the sample and bound 
probe molecules typically detected and quantified by autoradiography and/or liquid scintillation 
counting. 

Non-radioactive labels include, for example, ligands such as biotin or thyroxine, as well 
as enzymes such as hydrolases or peroxidases, or the various chemiluminescers such as luciferin, 
or fluorescent compounds like fluorescein and its derivatives. The probe may also be labeled at 
both ends with different types of labels for ease of separation, as, for example, by using an 
isotopic label at the end mentioned above and a biotin label at the other end. 

Duplex formation and stability depend on substantial complementarity between the two 
strands of a hybrid, and, as noted above, a certain degree of mismatch can be tolerated. 
Therefore, the probes of the subject invention include mutations (both single and multiple), 
deletions, insertions of the described sequences, and combinations thereof, wherein said 
mutations, insertions and deletions permit formation of stable hybrids with the target 
polynucleotide of interest. Mutations, insertions, and deletions can be produced in a given 
polynucleotide sequence in many ways, by methods currently known to an ordinarily skilled 
artisan, and perhaps by other methods which may become known in the future. 

The potential variations in the probes listed is due, in part, to the redundancy of the 
genetic code. Because of the redundancy of the genetic code, Le,, more than one coding 
nucleotide triplet (codon) can be used for most of the amino acids used to make proteins. 
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Therefore diffeient nucleotide sequences can code for a particular amino acid. Thus, the amino 
acid sequences of the B. thuringiensis 5-endotoxins and peptides can be prepared by equivalent 
nucleotide sequences encoding the same amino acid sequence of the protein or peptide. 
Accordingly, the subject invention includes such equivalent nucleotide sequences. Also, inverse 

5 or complement sequences are an aspect of the subject invention and can be readily used by a 
person skilled in this art. In addition it has been shown that proteins of identified structure and 
function may be constructed by changing the amino acid sequence if such changes do not alter 
the protein secondary structure (Kaiser and Kezdy, 1984). Thus, the subject invention includes 
mutants of the amino acid sequence depicted herein which do not alter the protein secondary 

10 structure, or if the structure is altered, the biological activity is substantially retained. Further, 
the invention also includes mutants of orgzinisms hosting all or part of a 6-endotoxin encoding a 
gene of the invention. Such mutants can be made by techniques well known to persons skilled in 
the art. For example, UV irradiation can be used to prepare mutants of host organisms. 
Likewise, such mutants may include asporogenous host cells which also can be prepared by 

15 procedures well known in the art. 

6.0 Examples 

The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 
20 practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

6.1 Example 1 - Identification of J?, thuringiensis Strains Containing Novel 5 
2s -Endotoxins 

Wild-type B. thuringiensis strains containing novel insecticidal protein genes were 
identified by Southern blot hybridization studies employing specific DNA probes. Twenty-four 
unique cry genes were discovered that are related to B, thuringiensis genes in the cryl, cry2, or 
cry9 classes of toxin genes. 
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Various methods were employed to clone the novel genes and express them in a crystal 
protein-negative (Cry-) strain of S. thuringiensis. These methods include PGR™ amplification of 
the region of cryZ-related genes that encodes the active portion of the toxin gene. The PGR™ 
product is then joined to a fragment from the cry 1 Ac gene encoding the G-terminal region of the 
protoxin. This gene fusion was then expressed in a B. thuringiensis recombinant strain to 
produce a hybrid protoxin. In this instance, it is recognized that the sequence of the amplified 
DNA can be used to design hybridization probes to isolate the entire coding sequence of the 
novel cry gene from the wild-type B. thuringiensis strain. 

Wild-type B. thuringiensis strains were screened in a bioassay to identify strains that are 
toxic to larvae of lepidopteran insects (procedure described in Example 10). Active strains were 
then examined genetically to determine if they contain novel toxin genes. The method used to 
make this determination is described below and includes isolation of genomic DNA from the 
B. thuringiensis strain, restriction enzyme digestion. Southern blot hybridization, and analysis of 
the hybridizing restriction fragments to determine which genes are present in a strain. 

Total genomic DNA was extracted by the following procedure. Vegetative cells were 
resuspended in a lysis buffer containing 50 mM glucose, 25 mM Tris-HCl (pH 8.0), 10 mM 
EDTA, and 4 mg/ml lysozyme. The suspension was incubated at 37**C for 1 h. Following 
incubation, the suspension was extracted once with an equal volume of phenol, then once with an 
equal volume of phenol:chloroform:isoamyl alcohol (50:48:2), and once with an equal volume of 
chloroform:isoamyl alcohol (24:1). The DNA was precipitated from the aqueous phase by the 
addition of one-tenth volume 3 M sodium acetate and two volumes of 100% ethanoL The 
precipitated DNA was collected by centrifiigation, washed with 70% ethanol and resuspended in 
distilled water. 

The DNA samples were digested with the restriction enzymes Clal and Pstl. The 
combination of these two enzymes give a digestion pattern of fragments that, when hybridized 
with the probe wd207 (described below), allows the identification of many of the known cryl- 
related toxin genes. Hybridizing fragments that did not correspond to the fragment sizes 
expected for the known genes were classified as unknown and were candidates for cloning and 
characterization. 

The digested DNA was size fractionated by electrophoresis through a 1 .0% agarose gel in 
IX TBE (0.089 M Tris-borate, 0.089 M boric acid, 0.002 M EDTA) overnight at 2 V/cm of gel 
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length. The fractionated DNA fragments were then transferred to a Millipore Immobilon-NC® 
nitrocellulose filter (Millipore Corp., Bedford, MA) according to the method of Southern (1975). 
The DNA fragments were fixed to the nitocellulose by baking the filter at SO^^C in a vacuum 
oven. 

To identify the DNA firagment(s) containing the sequences related to cryl genes, the 
oligonucleotide wd207 was radioactively labeled at the 5' end and used as a hybridization probe. 
To radioactively label the probe, 1-5 pmoles of wd207 were added to a reaction (20 ul total 
volume) containing 3 ul [y-'^PJATP (3,000 Ci/mmole at 10 mCi/ml), 70 mM Tris-HCl, pH 7.8, 
10 mM MgCb, 5 mM DTT, and 10 imits T4 polynucleotide kinase (Promega Corp,, Madison, 
WI). The reaction was incubated for 20 min at 37**C to allow the transfer of the radioactive 
phosphate to the 5'-end of the oligonucleotide, thus making it usefiil as a hybridization probe. 

The oligonucleotide probe used in this analysis, designated wd207, has the following 
sequence: 

5'-TGGATACTTGATCAATATGATAATCCGTCACATCTGTTnTA-3' (SEQ IDNO:51) 
This oligonucleotide was designed to specifically hybridize to a conserved region of cryl 
genes downstream from the proteolyic activation site in the protoxin. Table 4 lists some of the 
B, thuringiemis toxin genes and their identities with wd207. The orientation of the wd207 
sequence is inverted and reversed relative to the coding sequences of the cry genes. 

Table 4 



cry Gene 


% Identity to wd207 


Nucleotide Position in CDS 


crylAa 


100% 


1903-1944 


crylBa 


95.2% 


1991-2032 


cryICa 


97.6% 


1930-1971 


crylDa 


97.6% 


1858-1899 


cryJEa 


97.6% 


1885-1926 



The labeled probe was then incubated with the nitrocellulose filter overnight at 45''C in 
3X SSC (IX SSC = 0.15 M NaCl, 0.015 M sodium citrate), 0.1% SDS, lOX Denhardfs reagent 
(0.2% BSA, 0.2% polyvinylpyrrolidone, 0.2% FicoU), and 0.2 mg/ml heparin. Following this 
incubation period, the filter was washed in several changes of 3X SSC, 0.1% SDS at 45°C. The 
filter was blotted dry and exposed to Kodak X-OMAT AR X-ray film (Eastman Kodak Co., 
Rochester, NY) overnight at -TO'^C with an intensifying screen to obtain an autoradiogram. 
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The autoradiograms were analyzed to determine which wild-type B. thuringiensis strains 
contained cryl genes that could be novel. Since the probe was only 42 nucleotides, it is unlikely 
that recognition sites for the restriction endonucleases C/al and Pstl would occur within the 
hybridizing region of the cryi -related genes. Therefore, it was assumed that each hybridizing 
restriction fragment represented one cry7-related gene. The sizes, in kilobases (kb), of the 
hybridizing restriction fragments were determined based on the migration of the fragment in the 
agarose gel relative to DNA fragments of known size. The size of a fragment could be used to 
determine if that fragment represented a known cryJ gene. For example, fiom the DNA 
sequence of the cryl Ac gene it was known that wd207 would hybridize to a 0.43 kb fragment 
after digestion of cryJAc DNA with Clal and PstL If the Southern blot analysis of a strain 
showed a 0.43 kb hybrizing fragment, that strain was assigned a probable genotype of cryl Ac. 
Fragments that could not be easily assigned a probable genotype were selected as candidates for 
further analysis. Because many cry/ -containing strains have more than one cryi -related gene, 
all fragments were given a putative designation. 
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6.2 EXAMPLE 2 - IDENTIFICATION OF B. THURINGIENSJS STRAINS 
CONTAINING NOVEL CRKZ-RELATED GENES 

Proteins encoded by the cry2 class of thuringiensis class of toxin genes have activity 
on the larvae of lepidopteran and diopteran insects. Southern blot hybridization analysis of DNA 
extracted from lepidopteran-active strains was utilized to identify novel cry2-related genes. 
Total genomic DNA was isolated as described in Section 6.1. The DNA was digested with the 
restriction endonuclease Sau3A and run on a 1 .2% agarose gel as described. The digested DNA 
was transferred to nitrocellulose filters to be probed with a DNA fragment containing the crylAa 
gene. Hybridizations were performed at 55°C and the filters washed and exposed to X-ray film 
to obtain an autoradiogram. 

Sau3A digestion followed by hybridization with the crylAa gene gave characteristic 
patterns of hybridizing fragments allowing the identification of the cry2Aa, cry2Ab, and crylAc 
genes. Hybridizing fragments that differed from these patterns indicated the presence of a novel 
cry2-related gene in that strain. 

Once a strain was identified as containing one or more novel cry2-related genes, an 
additional Southern blot hybridization was performed. The procedures were the same as those 
already described above, except another restriction enzyme, usually ///wdlll, was used. Since an 
enzyme like Hin6\W (a "six base cutter") cuts DNA less frequently than does SauiK or AZ&ol, it 
was more likely to generate a restriction fragment containing the entire cry2-related gene which 
could then be readily cloned. 

63 Example 3 — Identification of B. thuringiensis Strains Containing Novel crv9-Type 
Genes 

A c/yP-specific oligonucleotide, designated pr56, was designed to facilitate the 
identification of strains harboring cryP-type genes. This oligonucleotide corresponds to 
nucleotides 4349-4416 of the gene (GenBank Accession No. Z37527). The sequence of pr56 
was as follows: 

5'-AGTAACGGTGTTACTATTAGCGAGGGCGGTCCATTCTTTAA 
AGGTCGTGCACTTCAGTTAGC-3' (SEQ ID NO:52). 

B, thuringiensis isolates were spotted or "patched" on SGNB plates, with no more than 50 
isolates per plate, and grown overnight at 25^C. The B. thuringiensis colonies were transferred 
to nitrocellulose filters and the filters placed, colony side up, on fresh SGNB plates for overnight 
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growth at SO'^C. Subsequently, the filters were placed, colony side up, on Whatman paper 
soaked in denaturing solution (1.5 M NaCl, 0.5 N NaOH) for 20 min. After denaturation, the 
filters were placed on Whatman paper soaked in neutralizing solution (3 M NaCl, 1.5 M Tris- 
HCl, pH 7.0) for 20 min. Finally, the filters were washed in 3X SSC (IX SSC = 0.15 M NaCl 
and 0.015 M sodiimi citrate) to remove cellular debris and baked in a vacuum oven at 80^C for 
90 min. 

The cryP-specific oligonucleotide pr56 (-10 pmoles) was end-labeled with [y-^^P]ATP 
using T4 polynucleotide kinase. The labeling reaction was carried out at 37°C for 20 min and 
terminated by incubating the reaction at 100 C for 3 min. After ethanol precipitation, the labeled 
oligonucleotide was resuspended in 100 ^1 distilled H2O. 

The filters were incubated with the cryP-specific probe in 6X SSC, lOX Denhardt's 
solution, 0.5% glycine, 0.2% SDS at 47°C overnight. The filters were washed twice in 3X SSC, 
0.1% SDS for 15 min at 47^C and twice in IX SSC, 0.1% SDS for 15 mm at 4TC. The dried 
filters were exposed to X-OMAT XAR-5 film (Eastman Kodak Co.) at -70*'C using an 
intensifying screen. The developed autoradiogram revealed 24 isolates of B, thuringiensis 
containing DNA that hybridized to the cry9 probe. 

To identify cryPC-type genes among these strains, two opposing oligonucleotide primers 
specific for the cry9C gene (GenBank Accession No. Z37527) were designed for polymerase 
chain reaction (PGR™) analyses. The sequence of pr58 is: 

5'-CGACTTCTCCTGCTAATGGAGG-3' (SEQ ID NO:53). 
The sequence of pr59 is: 

5'-CTCGCTAATAGTAACACCGTTACTTGCC-3' (SEQ ID NO:54). 
Plasmid DNAs were isolated from the isolates of B. thuringiensis believed to contain cryP-type 
genes. B. thuringiensis isolates were grown overnight at 3Q^C on Luna agar plates and 2 
loopfuls of cells from each isolate were suspended in 50 mM glucose, 10 mM Tris-HCl, 1 mM 
EDTA (IX GTE) containing 4 mg/ml lysozyme. After a 10 min incubation at room temperature, 
plasmid DNAs were extracted using a standard alkaline lysis procedure (Maniatis et al, 1982). 
The plasmid DNAs were resuspended in 20 ^1 of IX TE (10 mM Tris-HCl, 1 mM EDTA, pH 
7.5). Two microliters of the plasmid DNA preparations were used in the PCR™ reactions. 
Amplifications were performed in 100 ^l volumes with a Perkin-Elmer DNA Thermocycler 
(Perkin-Elmer Cetus, Foster City, CA) using materials and methods provided in the Perkin- 
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Elmer GeneAmp™ kit. Conditions for the PCR™ were as follows: 95^C for 30 sec, 46*^0 for 30 
sec, 70°C for 1 min; 30 cycles. A PCR™ using these primers and the cry9C gene as a template 
should yield a DNA fragment of --970 bp. Of twenty-four strains found to hybridize to the cry9 
probe (SEQ ID NO:XX), only one strain, EG9290, yielded the predicted amplified DNA 
fragment. 

6.4 EXAMPLE 4 - CLONING OF B. THURINGIENSIS TOXIN GENES BY 
CONSTRUCTING MBOl PARTIAL DIGEST LIBRARIES 

The restriction endonuclease Mbol was utilized in the construction of genomic DNA 
libraries because it has a recognition sequence of four base pairs which occurs frequently in long 
stretches of DNA. Total genomic DNA was isolated from B, thuringiensis strains as described in 
Section 6.1. The DNA was digested under conditions allowing limited cleavage of a DNA 
strand. The method of establishing these conditions has been described (Maniatis et al, 1982). 
Digestion of DNA in this manner created a set of essentially randomly cleaved, overiapping 
fragments which were used to create a library representative of the entire genome. 

The digested DNA fragments were separated, according to size, by agarose gel 
electrophoresis through a 0.6% agarose, IX TBE gel, overnight at 2 volts/cm of gel length. The 
gel was stained with ethidium bromide so that the digested DNA could be visualized when 
exposed to long-wave UV light. A razor blade was used to excise a gel slice containing DNA 
fragments of approximately 9- kb to 12-kb in size. The DNA fragments were removed from the 
agarose by placing the slice in a dialysis bag with enough TE (10 mM Tris-HCl, 1 mM EDTA) to 
cover the slice. The bag was then closed and placed in a horizontal electrophoresis apparatus 
filled with IX TBE buffer. The DNA was electroeluted from the slice into the TE at 100 volts 
for 2 h. The TE was removed from the bag, extracted with phenol xhloroform (1:1), followed by 
extraction with chloroform. The DNA fragments are then collected by the standard technique of 
ethanol precipitation (see Maniatis et al„ 1982). 

To create a library in E. coli of the partially-digested DNA, the fragments were ligated 
into the shuttle vector, pHT315 (Arantes and Lereclus, 1991). This plasmid contains replication 
origins for E, coli and B, thuringiensis, genes for resistance to the antibiotics erythromycin and 
ampicillin, and a muhiple cloning site. The Mbol fragments were mixed with 5fl/wHI-digested 
pHT315 that had been treated with calf intestinal, or bacterial, alkaline phosphatase (GibcoBRL, 
Gaithersburg, MD) to remove the 5 '-phosphates from the digested plasmid, preventing re- 
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ligation of the vector to itself. After purification, T4 ligase and a ligation buffer (Promega Corp., 
Madison, WI) were added to the reaction containing the digested vector and the Mbol fragments. 
These were incubated overnight at 15**C, or at room temperature for 1 h, to allow the insertion 
and ligation of the Mbol fragments into the pHT315 vector DNA. 

The ligation mixture was then introduced into transformation-competent E. coli SURE® 
cells (Stratagene Cloning Systems, La JoUa, CA), following procedures described by the 
manufacturer. The transformed E, coli cells were then plated on LB agar plates containing 50-75 
Hg/ml ampicillin and incubated ovenight at The growth of several hundred ampicillin- 

resistant colonies on each plate indicated the presence of recombinant plasmid in the cells of 
each of those colonies. 

To isolate the colonies harboring sequences encoding toxin genes, the colonies were first 
transferred to nitrocellulose filters. This was accomplished by simply placing a circular 
nitrocellulose filter (Millipore HATF 08525, Millipore Corp., Bedford, MA) directly on top of 
the LB-ampicillin agar plates containing the transformed colonies. When the filter was slowly 
peeled off of the plate the colonies stick to the filter giving an exact replica of the pattern of 
colonies from the original plate. Enough cells from each colony were left on the plate that 5 to 
6 h of growth at 37°C restored the colonies. The plates were then stored at 4*^0 until needed. 
The nitrocellulose filters with the transferred colonies are then placed, colony-side up, on fresh 
LB-ampicillin agar plates and allowed to grow at 37°C until they reached an approximate 1 mm 
diameter. 

To release the DNA from the recombinant E. coli cells the nitrocellulose filters were 
placed, colony-side up, on 2-sheets of Whatman 3MM chromatogrphy paper (Whatman 
International Ltd., Maidstone, England) soaked with 0.5 N NaOH, 1.5 M NaCl for 15 min. This 
treatment lysed the cells and denatured the released DNA allowing it to stick to the nitrocellulose 
filter. The filters were then neutralized by placing the filters, colony-side up, on 2 sheets of 
Whatman paper soaked with 1 M NH4-acetate, 0.02 M NaOH for 1 0 min. The filters were rinsed 
in 3X SSC, air dried, and baked for 1 h at 80°C in a vacuum oven. The fihers were then ready 
for use in hybridization studies using probes to identify different classes of B. thuringiensis 
genes, as described in the above examples. 

In order to identify colonies containing cloned cr^Z-related genes, the cry7-specific 
oligonucleotide wd207 was labeled at the 5'-end with [y-^^P]ATP and T4 polynucleotide kinase. 
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The labeled probe was added to the filters in 3X SSC, 0.1% SDS, lOX Denhardt's reagent (0.2% 
BSA, 0.2% polyvinylpyrrolidone, 0.2% FicoU), 0,2 mg/ml heparin and incubated overnight at 
47°C. These conditions allowed hybridization of the labeled oligonucleotide to related 
sequences present on the nitrocellulose blots of the transformed E. coli colonies. Following 
incubation the filters were washed in several changes of 3X SSC, 0.1% SDS at 45X. The filters 
were blotted dry and exposed to Kodak X-OMAT AR X-ray film (Eastman Kodak Co., 
Rochester, NY) overnight at -70X with an intensifying screen. 

Colonies that contain cloned c/yi-related sequences were identified by aligning signals 
on the autoradiogram with the colonies on the original transformation plates. The isolated 
colonies were then grown in LB-ampicillin liquid medium firom which the cells could be 
harvested and recombinant plasmid prepared by the standard alkaline-lysis miniprep procedure 
(Maniatis et al, 1982). The plasmid DNA was then used as a template for DNA sequencing 
reactions necessary to confirm that the cloned gene was novel. If the cloned gene was novel, the 
plasmid was then introduced into a crystal protein-negative strain of A thuringietisis {Cry) so 
that the encoded protein could be expressed and characterized. These procedures are described 
in detail in the following sections. 

6.5 Example 5 - Cloning of Specific Endonuclease Restriction Fragments 

The identification of a specific restriction fragment containing a novel B. thurmgiensis 
gene has been described for cry2-related genes in Section 2. The procedure for cloning a 
restriction fragment of known size was essentially the same as described for cloning an Afbol 
fragment. The DNA was digested with a restriction enzyme (e,g, //mdlll), and run through an 
agarose gel to separate the fragments by size. Fragments of the proper size, identified by 
Southern blot analysis (Example 2), were excised with a razor blade and electroeluted from the 
gel slice into TE buffer firom which they could be precipitated. The isolated restriction fragments 
were then ligated into an E, coli/B, thurmgiensis shuttle vector and transformed into E, coli to 
construct a size-selected library. The library could then be hybridized with a specific gene 
probe, as described in Example 4, to isolate the colony containing the cloned novel gene. 

6.6 Example 6 - Cloning of PCR^-Amplified Fragments 

A rapid method for cloning and expressing novel cry] gene fragments from 
B, thuringiensis was developed using the polymerase chain reaction. Flanking primers were 
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designed to anneal to conserved regions 5' to and within cryl genes. With the exception of 

certain cry3 genes, most B. thuringiensis cry genes are transcriptionally regulated, at least in part, 

by RNA polymerases containing the mother cell-specific or sigE, sigma factor. These a^- 

regulated cry genes possess 5' promoter sequences that are recognized by a^. Alignment of these 

promoter sequences reveals considerable sequence variation, althoi^ a consensus sequence can 

be identified (Baum and Malvar, 1995). A primer, designated "sigE", containing a sequence 

identical to the cryl Ac promoter sequence, was designed that would anneal to related 

promoter sequences 5' to uncharacterized cry genes. The sigE primer also includes a Bbul site 

(isoschizimer: Sphl) to faciUtate cloning of amplified fiagments. The sequence of the sigE 

primer is shown below: 

5'-ATTTAGTAGCATGCGTTGCACTTTGTGCATTTTTTCATAAGATGA 

GTCATATGTTTTAAAT-3' (SEQ ID NO:55). 

The opposing primer, designated KpnR, anneals to a 3'-proximal region of the cryl gene 
that is generally conserved. This primer incorporates an ^5^718 site (isoschizimer: Kpril) 
conserved among the cryJA genes to facilitate cloning of the amplified fragment and to permit 
the construction of fusion proteins containing a carboxyl-terminal portion of the Cryl Ac protein. 
The sequence of the KpnR primer is shown below: 

5'-GGATAGCACTCATCAAAGGTACC-3' (SEQ ID NO:56) 

PCR™s were carried out using a Perkin Elmer DNA thermocycler and the following 
parameters: 94»C, 2 min.; 40 cycles consisting of 94<'C, 30 sec; 40«'C, 2 min; 72''C, 3 min; and a 
10 second extension added to the 72'C incubation after 20 cycles. The standard PGR™ buffer 
(100 nl volume) was modified to include IX Taq Extender buffer, 25 jiM each of the sigE and 
KpnR primers, and 0.5 - 1.0 (il of Taq Extender (Stratagene Inc.) in addition to 0.5 - 1.0 nl of 
Taq polymerase. Typically, 1-2 jil of the DNA preparations firom novel B. thuringiensis isolates 
were included in the PCR™s. PCRt»<s with cry genes incorporating these primers resulted in the 
ampUfication of a ~2.3-kb DNA fragment flanked by restriction sites for Bbul and AspllS. 

For the cloning and expression of these gene fi-agments, the crylAc shuttle vector 
pEG1064 was used. This plasmid is derived from the cry] Ac shuttle vector pEG857 (Baum et 
al., 1990), with the following modifications. A frameshift mutation was generated at a unique 
Ncol site within the cryl Ac coding region by cleaving pEG857 with the restriction endonuclease 
Ncol, blunt-ending the /\fcoI-generated ends with Klenow polymerase and ligating the blunt ends 
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with T4 ligase. In similar fashion, an Asp7\% site located in the multiple cloning site 3* to the 
cry 1 Ac gene was removed, leavmg only the single AspJli site contained within the cry 1 Ac 
coding sequence. The resulting plasmid, pEG1064, cannot direct the production of crystal 
protein when introduced into an acrystalliferous (Cry ) strain of B, thuringiensis because of the 
fiameshift mutatioa For cloning and expression of unknown cry genes, pEG1064 was cleaved 
with Bbul andAsplM and the vector fragment purified following gel electrophoresis. Amplified 
fi^igments of unknown cry genes, obtained by PGR™ amplification of total B. thuringiensis 
DNA, were digested with the restriction endonucleases Bbul and Aspl\% and ligated into the 
Bbul and Aspl\% sites of the pEG1064 vector firagment. The ligation mixture was used to 
transform the Cry' B, thuringiensis strains, EG10368 or EG10650, to chloramphenicol resistance 
using an electroporation protocol previously described (Mettus and Macaluso, 1990) 
Chloramphenicol-resistant (Cm^) isolates were evaluated for crystal protein production by 
phase-contrast microscopy. Crystal forming (Cry-f) isolates were subsequently grown in C2 
liquid broth medium (Donovan et al., 1988) to obtain crystal protein for SDS-PAGE analysis and 
insect bioassay. 

Because of the frameshift mutation within the cry 1 Ac gene, the crystal proteins obtained 
from the transformants could not be derived from the vector pEG1064. The Cry"^ transformants 
thus contained unknown cry gene fragments fused, at the AspllS site, to a 3 '-portion of the 
cry 1 Ac gene. Transcription of these gene fusions in B, thuringiensis was presumably directed 
from the promoter incorporated into the amplified cry gene fragment. The fusion proteins, 
containing the entire active toxin region of the unknown Cry protein, were capable of producing 
crystals in B. thuringiensis, 

6.7 Example 7 - Cloning of cry9-Related Genes 

Total DNA was isolated from B. thuringiensis strain EG9290 for cloning studies. 
EG9290 was grown overnight at SO^'C in IX brain heart infusion, 0.5% glycerol (BHIG). In the 
morning, 500 |il of the overnight growth was suspended in 50 mi BHIG and the culture 
incubated at 30°C with agitation until the culture reached a Klett reading of 150 (red filter). The 
cells were harvested by centrifugation, suspended in 5 ml IX GTE buffer containing 4 mg/ml 
lysozyme and 100 fig/ml Rnase A, and incubated at 37**C for 20 min. The cells were lysed by 
the addition of 0.5 ml of 20% SDS. The released DNA was precipitated by the addition of 2.5 
ml 7.5 M ammonium acetate and 7 ml of isopropanol. The precipitated DNA was spooled out of 



wo 01/19859 



PCT/USOO/25361 



-64- 

the mixture using a glass micropipette and washed in 80% ethanol. The DNA was resuspended 
in 10 ml IX TE, extracted with one volume each of buffered phenol and chioroform:isoamyl 
alcohol (24:1), and precipitated as before. The spooled DNA was washed in 80% ethanol, 
allowed to air dry for several min, and suspended in 600 p.1 IX TE. The DNA concentration was 
estimated at 500 Kig/ml. 

A library of EG9290 total DNA was constructed using partially digested Mbol fragments 
of EG9290 DNA and the general methods described herein. The partial Aibol fragments were 
inserted into the unique BamHl site of cloning vector pHT3 15. The ligation mixture was used to 
transform E. coli Sure™ cells to ampicillin resistance by electroporation employing 
electrocompetent cells and protocols provided by Stratagene (La JoUa, CA) and the BioRad Gene 
Pulser™ apparatus (Bio-Rad Laboratories, Hercules, CA). Recombinant clones harboring cry9- 
type genes were identified by colony blot hybridization using a ^^P-labeled probe consisting of 
the putative cry9C fragment generated by amplification of EG9290 DNA with primers pr58 and 
pr59. Plasmid DNAs were extracted from the E. coli clones using a standard alkaline lysis 
procedure. 

Plasmid DNAs from the E. coli recombinant clones were used to transform 
5. thuringiensis strain EG 10368 to erythromycin resistance using the electroporation procedure 
described by Mettus and Macaluso (1990). Cells were plated onto starch agar plates containing 
20 |ag/ml erythromycin and incubated at 30°C. After six days, colonies with a more opaque 
appearance were recovered from the plates and streaked out onto fresh starch agar plates 
containing 20 ^g/ml erythromycin to isolate single colonies. Colonies exhibiting a more opaque 
appearance were observed to produce large parasporal inclusions/crystals by phase-contrast 
microscopy. 

Recombinant EG 10368 clones producing parasporal inclusion/crystals were evaluated for 
crystal protein production in broth culture. Single colonies were inoculated into C2 medium 
containing 10 ^g/ml erythromycin and grown at 30**C for 3 days at 28-30'*C, at which time the 
cultures were fiiUy sporulated and lysed. Spores and crystals were pelleted by centrifiigation and 
resuspended in 20 mM Tris-HCl, 1 mM EDTA, pH 7.0. Aliquots of this material were analyzed 
by SDS-polyacrylamide gel electrophoresis (SDS-PAGE). Two EG10368 recombinant clones, 
initially identified as 9290-2 and 9290-3, were observed to produce distinct proteins of -130 
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IcDa. 9290-2 was designated EG12102 and 9290-3 was designated EG12103. The EG12102 
protein was designated CryET59 while the EG12103 protein was designated CryET60. 

Plasmid DNAs were prepared from EG12102 and EG12103 using a standard alkaline 
lysis procedure. Digestion of the plasmids with the restriction endonuclease Xbal confirmed that 
the two strains harbored distinct cry genes. The cry plasmids of EG12102 and EG12103, 

designated pEG945 and pEG946, respectively, were used to transform E, coli Sure ' cells to 
ampicillin resistance by electroporation, employing electrocompetent cells and protocols 
provided by Stratagene Inc. The E. coli recombinant strain containing pEG945 was designated 
EG12132, and the E. coli recombinant strain containing pEG946 was designated EG12133. 
pEG945 and pEG946 were purified from the E. coli recombinant strains using the QIAGEN 
midi-column plasmid purification kit and protocols (QIAGEN Inc., Valencia, CA). 

The cryET83 gene was cloned from B, thuringiensis strain EG6346 subspecies aizawai 
using similar methods. Southern blot analysis of genomic DNA from EG6346 revealed a unique 
restriction fragment that hybridized to the cryET59 probe. A series of degenerate 
oligonucleotide primers, pr95, pr97, and pr98, were designed to amplify cryP-related sequences 
from genomic DNA. The sequences of these primers are as shown: 

pr95: 5'- GTWTGGACSCRTCGHGATGTGG -3* (SEQIDNO:57) 
pr97: 5'- TAATTTCTGCTAGCCCWATTTCTGGATTTAATTGTTGATC -3' 
(SEQIDNO:58) 

pr98: 5'- ATWACNCAAMTWCCDTTRG -3' (SEQ ID NO:59) 

where D = A, G; H = A, C, T; M =A, C; N = A, C, G, T; R = A, G; S = C, G; and W = A, T. 

A PGR™ using Taq polymerase, Taq Extender^*^ (Stratagene, La Jolla, CA), the 
opposing primers pr95 and pr97, and total EG6346 DNA yielded a DNA fragment that was 
faintly visible on an ethidium bromide-stained agarose gel. This DNA served as the template for 
a second roxmd of PGR™ using the opposing primers pr97 and pr98. The resulting amplified 
DNA fragment was suitable for cloning and served as a hybridization probe for subsequent 
cloning experiments. A library of EG6346 total DNA was constructed using partially digested 
9-12 kb Mbo\ fragments of EG6346 DNA ligated into the unique BamHl site of cloning vector 
pHT315, E. coli recombinant clones harboring the cryET83 gene were identified by colony blot 
hybridization using the EG6346-specific DNA fragment as a chemiluminescent hybridization 
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probe and the CDP-Star™ nucleic acid chemiluminescent reagent kit from NEN™ Life Science 
Products (Boston, MA) to prepare the hybridization probe. The recombinant plasmid harboring 
the cryET83 gene was designated pEG397. The E. coli recombinant stain containing pEG397 
was designated EG11786. The B. thuringiensis recombinant strain containing pEG397 was 
designated EG 11785. 

6.8 Example 8 ~ Sequencing of Cloned B. thuringiensis Toxin Genes 

Partial sequences for the cloned toxin genes were determined following established 
dideoxy chain-termination DNA sequencing procedures (Sanger et al, 1977). Preparation of the 
double stranded plasmid template DNA was accomplished using a standard alkaline lysis 
procedure or using a QIAGEN plasmid purification kit (QIAGEN Inc., Valencia, CA). The 
sequencmg reactions were performed using the Sequenase™ Version 2.0 DNA Sequencing Kit 
(United States Biochemical/Amersham Life Science Inc., Cleveland. OH) following the 
manufacturer's procedures and using ^*S-dATP as the labeling isotope (obtained from DuPont 
NEN® Research Products, Boston, MA). Denaturing gel electrophoresis of the reactions is done 
on a 6% (wtVvol.) acrylamide, 42% (wt./vol.) urea sequencing gel. The dried gels are exposed to 
Kodak X-OMAT AR X-ray fihn (Eastman Kodak Company, Rochester, NY) overnight at room 
temperature. Alternatively, some cry genes were sequenced using automated sequencing 
methods. DNA samples were sequenced using the ABl PRISM^" DyeDeoxy sequencing 
chemistry kit (Applied Biosystems, Foster City, CA) according to the manufacturer's suggested 
protocol. The completed reactions were run on as ABI 377 automated DNA sequencer. DNA 
sequence data were analyzed using Sequencher^M v3.0 DNA analysis software (Gene Codes 
Corp., Ann Arbor, Ml). Successive oligonucleotides to be used for priming sequencing reactions 
were designed from the sequencing data of the previous set of reactions. 

The sequence determination for the cry/-related genes involved the use of the 
oligonucleotide probe wd207, described in Example 2, as the initial sequencing primer. This 
oligonucleotide anneals to a conserved region of cryl genes, but because of the inverted and 
reversed orientation of wd207, it generates sequence towards the 5'-end of the coding region 
allowing sequence of the variable region of the gene to be read. A typical sequencing run of 
250-300 nucleotides was usually sufficient to determine the identity of the gene. If additional 
data were necessary, one or more additional oligonucleotides could be synthesized to continue 
the sequence until it could be determined if the sequence was unique. In cases where wd207 did 
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not function well as a primer, other oligonucleotides, designed to anneal to conserved regions of 
cryl genes, were used. One such oligonucleotide was the KpnR primer described herein above. 

The sequencing of the cloned cry^-related genes followed the same general procedures as 
those described for the cryl genes, except that oligonucleotides specific for conserved regions in 
cry2 genes were used as sequencing primers. The two primers used in these examples were 
wd268 and wd269, shown below. 

Primer wd268 corresponds to cry2Aa nucleotides 579-597 
5'-AATGCAGATGAATGGGG-3' (SEQ ID NO:60). 
Primer wd269 corresponds to crylAa 1740-1757 
5'-TGATAATGGAGCTCGTT-3' (SEQ ID NO:61) 

The sequencing of cryET59 and cryET60 commenced with the use of primer pr56. The 
sequencing of cryET82 commenced with the use of primer pr98. Successive oligonucleotides to 
be used for priming sequencing reactions were designed from the sequencing data of the 
previous set of reactions. 

The derived sequences were compared to sequences of known cry genes using the 
FSTNSCAN program in the PC/GENE sequence analysis package (Intelligenetics, Mountain 
View, CA). This analysis permitted a preliminary classification of the cloned cry genes with 
respect to previously-known cry genes (Table 1 1). 
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TABLE 6 - HOMOLOGY COMPARISON OF DN A SEQUENCES^ 



Cloned Gene DNA Sequence Identity 

cryETSl 90% identity with SEQ ID NO:4 of WO 98/40490 

cryET40 ^^'^^ identity with crylAa 

cryET43 ^^^^ identity with crylBdJ 

cryET44 ^^^^ identity with crylDa/lDb 

cryET45 ^1% identity with cry I Da/I Db 

cryET46 ^^"^^ identity with cryJGa 

cryET47 ^^^^ identity with crylAb 

cryET49 ^5% identity with cryUa 

cryETS I ^5% identity with cry I Ac 

crvET52 ^^^^^^ identity with cry I Da/I Db 

cLeT53 99% identity with SEQ ID NO:8 of US 5,723,758 

cryET54 ^-^^'^ identity with crylBe 

cryETSe identity with cry I Ac 

cryET57 ^8% identity with crylDa 

cryET59 9^*^** identity with cry9Ca 

cryET60 99.&yo identity with cry9Aa 

cryET6 1 ^'^^^ identity with cry 1 Ha 

cryET62 ^^^^ identity with cry 1 Ad 

crvET63 ^^^^ identity with cry J Ac 

cn;ET64 91% identity with SEQ ID NO:9 of US 5,723,758 

crvET66 76% identity with crylGa 

crvET67 99% identity with SEQ ID NO: 10 of US 5,723.758 

cZeT72 98% identity with SEQ ID NO:4 of WO 98/40490 

ci,ET73 99% identity with SEQ ID NO:6 of WO 98/40490 



cryET83 ; — — ■ 

' Ktup value set at 2 for F STNSCAN. The cryET59 and cryET60 sequences were 

compared using the FASTA program (Ktup=6) in the PC/GENE sequence analysis package. 
6.9 Example 9 - Expression of Cloned Toxin Genes in a B. thuringiensis Host 

Plasmid DNA was isolated from coli colonies identified by hybridization to a gene- 
specific probe. The isolated plasmid was then introduced into a crystal protein-negative (Cry-) 
strain of B. thuringiensis using the electroporation protocol of Mettus and Macaluso (1990). 
Each of the cloning vectors used (see Table 5) has a gene to confer antibiotic resistance on the 
cells harboring that plasmid. B. thuringiensis transformants were selected by growth on agar 
plates containing 25 mg/ml erythromycin (pHT315) or 5 mg/ml chloramphenicol (pEG597 and 
pEG1064). Antibiotic-resistant colonies were then evaluated for crystal protein production by 
phase-contrast microscropy. Crystal producing colonies were then grown in C2 medium 
(Donovan et al. 1988) to obtain cultures which were analyzed by SDS^PAGE and insect 
bioassay. 

C2 cultures were inoculated with cells from Cry^ colonies and grown for three days at 
25-30^C in the presence of the appropriate antibiotic. During this time the culture grew to 
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Stationary phase, sponilated and lysed, releasing the protein inclusions into the medium. The 
cultures are harvested by centrifogation, which pellets the spores and crystals. The pellets were 
washed in a solution of 0.005% Triton X-100®, 2 mM EDTA and centrifiiged again. The 
washed pellets were resuspended at one-tenth the original volume in 0.005% Triton X-100®, 2 
mM EDTA. 

Crystal protein were solubilized from the spores-crystal suspension by incubating the 
suspension in a solubilization buffer [0.14 M Tris-HCl pH 8.0, 2% (wt./vol.) sodium dodecyl 
sulfate (SDS), 5% (vol./vol.) 2-mercaptoethanol, 10% (vol./vol.) glycerol, and 0.1% bromphenol 
blue] at lOO^^C for 5 min. The solubilized crystal proteins were size-fractionated by SDS-PAGE 
using a gel with an acrylamide concentration of 10%. After size fractionation the proteins were 
visualized by staining with Coomassie Brilliant Blue R-250. 

The expected size for Cryl- and Cry9-related crystal proteins was approximately 130 
kDa. The expected size for Cry2-related proteins was approximately 65 kDa. 

6,10 Example 10 - Insecticidal Activity of the Cloned B. ihuringiensis Toxin Genes 

B. ihuringiensis recombinant strains producing individual cloned cry genes were grown 
in C2 medium until the cultures were fully sponilated and lysed. These C2 cultures were used to 
evaluate the insecticidal activity of the crystal proteins produced. Each culture was diluted with 
0.005% Triton® X-100 to achieve the appropriate dilution for two-dose bioassay screens. Fifty 
microliters of each dilution were topically applied to 32 wells containing 1 .0 ml artificial diet per 
well (surface area of 175 mm^). A single lepidopteran larvae was placed in each of the treated 
wells and the tmy was covered by a clear perforated mylar sheet. With the exception of the P. 
xylostella bioassays, that employed 3rd instar larvae, all the bioassays were performed with 
neonate larvae. Larval mortality was scored after 7 days of feeding at 28-30 ""C and percent 
mortality was expressed as ratio of the number of dead larvae to the total number of larvae 
treated (Table 12). In some instances, severe stunting of larval growth was observed after 7 
days, and the ratio of stunted/unstunted larva was also recorded. The bioassay results shown in 
Table 7 demonstrate that the crystal proteins produced by the recombinant B. thuringiensis 
strains do exhibit insecticidal activity and, finthermore. 
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Spodoptera exigua 



Tabic 7A. Bioassay evaluations with ET crystal proteins 

Spodoptera frugiperda 







2500nl/weU 


# stunted 




o/g. mortalitv 

/O lllVllallijf 


% mortality 


/# treated 


Cry 1 Ac 


0 


c 


4/32 


ET31 


5 




17/32 


ET40 


0 




n 
u 


ET43 


0 


8 


U 


ET44 


0 


2 


0 


ET45 


0 


0 


u 


ET46 


0 


12 


U 


ET47 


19 


49 


11/32 


ET49 


0 


8 


0 


ET51 


0 


0 


0 


ET52 


0 


0 


0 


ET53 


0 


0 


0 


ET54 


0 


66 


3/32 


ET56 


0 


0 


0 


ET57 


2 


15 


18/32 


ET59 


0 


0 


0 


ET60 


0 


0 


0 


ET61 


2 


5 


2/32 


ET62 


2 


59 


12/32 


ET63 


0 


12 


5/32 


ET64 


0 


0 


0 


ET66 


0 


12 


1/32 


ET67 


29 


90 


0 


ET72 


0 


0 


0 


ET73 


0 


2 


0 


Control 


8 


8 


0 



250nl/well 
% mortality 



16 
9 

3 
3 
6 
0 
0 
31 
0 
0 
3 
3 
6 
0 
3 
0 
0 
0 
0 
3 
3 
3 
13 
3 
0 
0 



2500nl/well 
% mortality 



53 
6 
3 
3 
0 
0 
6 
81 
3 
0 
3 
0 

34 
6 

94 
3 
3 
3 
13 
0 
6 
0 
61 
94 
0 
0 



# stunted 
/# treated 



1/32 
4/32 

0 
2/32 
1/32 
1/32 

0 
6/32 

0 

0 

0 

0 
9/32 

0 

0 

0 

0 

0 

0 

0 

0 
1/31 

0 
5/31 

0 

0 
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Table 7B. Bioassay evaluations with ET crystal proteins 
Plutella xyhstella Ostrinia nubiiaiis 





250nl/well 


2500n]/well 


# stunted 


250nl/well 


2500nl/well 


U stunted 




% mortality 


% mortality 


m treated 


% mortality 


% mortality 


/U treated 


Cry I Ac 


100 


100 


0 


100 


100 


0 


ET31 


0 


2 


0 


100 


100 


0 


ET40 


0 


68 


0 


0 


0 


2/32 


ET43 


5 


100 


0 


46 


100 


0 


ET44 


0 


0 


0 


0 


0 


3/32 


ET45 


0 


0 


0 


0 


0 


4/32 


ET46 


0 


8 


0 


0 


0 


0 


ET47 


100 


too 


0 


100 


100 


0 


ET49 


0 


5 


0 


0 


0 


0 


ET5I 


0 


0 


0 


0 


0 


0 


ET52 


2 


43 


0 


0 


14 


16/32 


ET53 


8 


97 


0 


4 


46 


5/32 


ET54 


14 


100 


0 


25 


89 


1/32 


ET56 


0 


0 


0 


0 


0 


0 


ET57 


0 


97 


0 


0 


7 


0 


ET59 


100 


100 


0 


96 


100 


0 


ET60 


100 


100 


0 


100 


96 


0 


ET61 


0 


11 


0 


0 


0 


2/32 


ET62 


97 


100 


0 


100 


100 


0 


ET63 


100 


too 


0 


100 


100 


0 


ET64 


40 


100 


0 


68 


100 


0 


ET66 


100 


100 


0 


86 


100 


0 


ET67 


87 


100 


0 


0 


79 


1/32 


ET72 


0 


0 


0 


0 


0 


0 


ET73 


2 


2 


0 


93 


100 


0 


Control 


2 


2 


0 


0 


0 


0 
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Tablc 7C. Bioassay evaluations with ET crystal proteins 



Heliothis virescens Helicoverpa zea 





250nl/well 


2500nl/well 


# stunted 


23unvweii 


Z^UUnl/Weil 




% mortality 


% mortality 


tlx - ^ - -1 

m treated 


Vo mortal ny 


/o monauiy 


CrylAc 


100 


100 


0 


100 


100 


ET31 


97 


97 


1/32 


8 


81 


ET40 


2 


5 


2/32 


2 


5 


ET43 


87 


97 


1/32 


0 


2 


ET44 


8 


5 


1/32 


5 


8 


ET45 


0 


11 


0 


8 


18 


ET46 


12 


25 


0 


0 


8 


ET47 


87 


100 


0 


83 


100 


ET49 


8 


2 


0 


n 


15 


ET51 


2 


15 


0 


5 


5 


ET52 


0 


31 






]] 


ET53 


22 


64 


2/32 


90 


61 


ET54 


15 


64 


5/32 


2 


5 


ET56 


0 


11 


0 


8 


0 


ET57 


2 


0 


0 


11 


28 


ET59 


28 


84 


4/32 


2 


2 


ET60 


56 


97 


1/32 


31 


28 


ET61 


5 


5 


0 


8 


5 


ET62 


44 


87 


4/3i2 


21 


64 


ET63 


100 


100 


0 


100 


100 


ET64 


0 


21 


0 


5 


0 


ET66 


0 


8 


1/32 


0 


5 


ET67 


18 


93 


1/32 


0 


68 


ET72 


34 


64 


11/32 


8 


2 


ET73 


42 


90 


2/32 


8 


48 


Control 


5 


5 


0 


5 


5 
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Table 7D. Bioassay evaluations with ET crystal proteins 



Agrotis ipsiloh Trichoplusia ni 





^JMllU well 


x^vviii/ well 


TT SLUIllCU 


i^tij 1/111/ well 


^j\j\fitu well 


ft aiuiiicu 




/O iiitii iaiiijr 


fO iiiuiiaiiijr 


treated 


mArfnlftv 

lllvl icillljr 


^/fk mArtalitv 
/o iiii/i iciiiijr 


/fr ucdtevj 


v-ry i/\c 


Oil 


inn 

lUU 




lUU 


1 AA 
lUU 


A 

U 


HI 31 


O 


0 




OA 


1 AA 


A 

u 




/\ 
U 


0 




11 
1 J 




A 

u 




u 


Vic 
43 




1 AA 


1 AA 
lUU 


A 

u 


Jb 144 


D 


1 1 




lO 


2o 


A 

u 


1 143 


A 


0 




13 


39 


A 
U 


br4o 


0 


0 




OA 


74 


A 


tI47 


U 


34 




97 


1 A A 


A 

0 


ET49 


3 


0 




13 


81 


0 




0 


() 






10 


A 

\J 


ET52 


0 


28 




81 


100 


0 


ET53 


25 


81 




74 


100 


0 


ET54 


3 


6 




100 


100 


0 


ET56 


3 


3 




16 


26 


0 


ET57 


13 


74 




19 


100 


0 


ET59 


3 


3 




10 


84 


0 


ET60 


3 


0 




97 


100 


0 


ET61 


6 


28 




29 


52 


0 


ET62 


23 


58 




100 


100 


0 


ET63 


3 


0 




100 


100 


0 


ET64 


0 


0 




87 


100 


0 


ET66 


13 


91 




26 


81 


0 


ET67 


3 


0 




6 


100 


0 


ET72 


0 


0 




23 


74 


8/32 


ET73 


13 


6 




94 


100 


0 


Control 


0 


0 




3 


3 


0 



that the crystal proteins exhibit differential activity towards the lepidopteran species tested. 

Additional bioassays were performed with the crystal proteins designated CryET59, 
CryET60, CryET66, and CryET83. Crystal proteins produced in C2 medium were quantified 
by SDS-PAGE and densitometry using the method described by Brussock, S. M. and Currier, T. 
C, 1990, "Use of Sodium Dodecyl Sulfate-Polyacrylamide Gel Electrophoresis to Quantify 
Bacillus thuringiensis 8-Endotoxins", in Analytical Chemistry of Bacillus thuringiensis (L. A. 
Hickle and W. L. Fitch, eds.). The American Chemical Society, pp. 78-87. 
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Table 8 - Bioassay Evaluation of CryET59 and CryET60 





Percent mortality 



Toxin 



Control^ 



CryET59 
CryET59 
CryET59 
CryET60 
CryET60 



TN 



D5Si AI HV HZ ON fx rPX SE 

ng/Well rr- 

2 6 0 0 2 0 2 0 



100 2 37 0 94 100 100 2 13 

500 11 80 3 100 100 100 0 63 
5000 62 100 6 100 100 100 71 100 
500 0 93 22 100 100 100 0 100 

. 5000 2 100 25 100 100 100 14 I00_ 

' Al = Agrotis ipsilon, H V = HeliMhis virescens. HZ = Helicoverpa zea, ON = Ostrmia 
nubilalis. PX = Plutella xylostella, rPX = Plutella xylostella colony resistant to Cry 1 A and 
Cry IF toxins, SE = Spodoptera exigua. TN = Trichc^lusia ni. 
^Control °* no toxin added. 

The procedure was modified to eliminate the neutralization step with 3M HEPES. 
Crystal proteins resolved by SDS-PAGE were quantified by densitometry using a Molecular 
Dynamics model 300A computing densitometer and purified bovine serum albumin (Pierce, 
Rockford, IL) as a standard. 

The bioassay results shown in Table 8 demonstrate that CryET59 and CryET60 are toxic 
to a number of lepidopteian species, including a colony of P. xylosiella that is resistant to Cryl A 
and CrylF crystal proteins. Eight-dose assays with CryET66 also demonstrated excellent 
toxicity towards both the susceptible and resistant colonies of P. xylostella (Table 14). In this 
instance, eight crystal protein concentrations were prepared by serial dilution of the crystal 
protein suspensions in 0.005% Triton® X-100 and 50 ul of each concentration was topically 
applied to wells containing 1.0 ml of artificial diet. After the wells had dried, a single larvae was 
placed in each of the treated wells and the tray was covered by a clear perforated mylar sheet (32 
larvae for each crystal protein concentration). Larval mortality was scored after 7 days of 
feeding at 28-30 °C. Mortality data was expressed as LC50 and LC95 values, the concentration of 
crystal protein (ng/175 mm^ diet well) causing 50% and 95% mortality, respectively (Daum, 
1970). 



Toxin 


LC50' 


95% C.I. 


LC95' 


Slope 


-Cryl Ac 
CrylC 
CryET66 


8.05 

25.06 

0.42 


5.0-15.2 

15.7-40.6 

0.4-0.5 


52.94 

117.07 

1.4 


2.01 
2.46 
3.13 
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Toxicity of CryET66 towards Cry 1 A*resistant Fluleila xylosteiia 



Toxin 


LCso' 


95% CI. 




Slope 


Cry 1 Ac 


*No significant mortality 

> 






CrylC 


27.32 


15.4-51.1 


156.13 


2.17 


CryET66 


1.65 


1.3-2.0 


6.41 


2.79 



' the concentration of crystal protein, in nanograms of crystal protein per well, required to achieve 50% mortality 



^ the concentration of crystal protein, in nanograms of crystal protein per well, required to achieve 95% mortality. 

Table 15 shows that the CryET83 protein exhibits toxicity towards a wide variety of 
lepidopteran pests and may exhibit improved toxicity towards S. exigua and K virescens when 
compared to the other Cry9-type proteins CryET59 and CryET60. 



Table 10 - Toxicity of CryET83 towards lepidopteran larvae^ 



Dose^ 




HV 


HZ 


ON 


PX 


SE 


SF 


TN 


5 










5 








10 








9 










50 




53 






75 






69 


100 








91 










500 


0 


100 








67 




100 


5000 


32 










100 






10000 






84 








100 





Toxicity calculated as percent mortality among treated larvae. 
^ ng CryET83 crystal protein/ 1 75 mm^ diet well 
^ Abbreviations described in Table 8; SF = Spodoptera frugiperda 



The recombinant A thuringiertsis strains listed in Table 5 were deposited with the ARS Patent Culture Collection 
and had been assigned the NRRL deposit numbers shown in Table 11. 



Table]]. Biological Deposits 



Polypeptide 


Polypeptide 


Polynucleotide 


Recomb. 


NRRL Deposit 


Designation 


Seq. iO No.: 


Seq ID No.: 


Strain 


No.: 


Cry ET31 


2 


1 


EG11562 


B-21921 


Cry ET40 


4 


3 


EG11901 


B-21922 


Cry ET43 


6 


5 


EG7692 


B-21923 


Cry ET44 


8 


7 


EG 11629 


8-21924 


Cry ET45 


10 


9 


EG7694 


B-21925 


Cry ET46 


12 


11 


EG7695 


B-21926 


Cry ET47 


14 


13 


EG7696 


B-21927 


Cry ET49 


16 


15 


EG11630 


B-21928 


Cry ET51 


18 


17 


EG11921 


B-21929 


Cry ET52 


20 


19 


EG11584 


B-21930 


Cry ET53 


22 


21 


EG11906 


8-21931 
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Polypeptide Polypeptide Polynucleotide 
Designation Seq. ID No.: Seq ID No.; 



Recomb. 
Strain 



NRRL Deposit 
No.: 



Cry ET54 
Cry ET56 
Cry ET57 
Cry ET59 
Cry ET60 
Cry ET61 
Cry ET62 
Cry ET63 
Cry ET64 
Cry ET66 
Cry ET67 
Cry ET68 
CryET72 
Cry ET73 
CryET83 



63 
24 
26 
28 
30 
32 
34 
36 
38 
40 
42 
44 
46 
48 
50 



62 
23 
25 
27 
29 
31 
33 
35 
37 
39 
41 
43 
45 
47 
49 



EG11907 
EG11909 
E611910 
EG12102 
EG12103 
EG11634 
EG11635 
EG11636 
EG11638 
EG11640 
EG11642 
EG11644 
EG11440 
EG11465 
EG11785 



B-21932 
B-21933 
B-21934 
B-21935 
B-21936 
B-21937 
B-21938 
B-21939 
B-21940 
B-21941 
B-21942 
B-30137 
B-21943 
B-21944 
8-30138 



6.11 Example 11 - Modification of cry Genes for Expression in Plants 

WUd-type cry genes are known to be expressed poorly in plants as a full length gene or as 
a truncated gene. Typically, the G+C content of a cry gene is low (37%) and often contains 
many A+T rich regions, potential polyadenylation sites and numerous ATTTA sequences. Table 
12 shows a list of potential polyadenylation sequences which should be avoided when preparing 
the "plantized" gene construct. 
Table 12 - LIST OF SEQUENCES OF POTENTIAL POLYADENYLATION SIGNALS 



AATAAA* 


AAGCAT 


AATAAT* 


ATTAAT 


AACCAA 


ATACAT 


ATATAA 


AAAATA 


AATCAA 


ATTAAA** 


ATACTA 


AATTAA** 


ATAAAA 


AATACA** 


ATGAAA 


CATAAA** 



\ potential major plant polyadenylation site, 
indicates a potential minor animal polyadenylation site. 
All others are potential minor plant polyadenylation sites. 
The regions for mutagenesis may be selected in the following maimer. All regions of the 
DNA sequence of the cry gene are identified which contained five or more consecutive base 
pairs which were A or T. These were ranked in terms of length and highest percentage of A+T in 
the surrounding sequence over a 20-30 base pair region. The DNA is analysed for regions which 
might contain polyadenylation sites or ATTTA sequences. Oligonucleotides are then designed 
which maximize the elimination of A+T consecutive regions which contained one or more 
polyadenylation sites or ATTTA sequences. Two potential plant polyadenylation sites have been 
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shown to be more critical based on published reports. Codons are selected which increase G+C 
content, but do not generate restriction sites for enzymes useful for cloning and assembly of the 
modified gene (e.g., BamHh Bglll, Sad, Ncol, EcoRW, etc). Likewise condons are avoided 
which contain the doublets TA or GC which have been reported to be infrequently-found codons 
s in plants. 

Although the CaMV35S promoter is generally a high level constitutive promoter in most 
plant tissues, the expression level of genes driven the CaMV35S promoter is low in floral tissue 
relative to the levels seen in leaf tissue. Because the economically important targets damaged by 
some insects are the floral parts or derived from floral parts (e.g., cotton squares and bolls, 

10 tobacco buds, tomato buds and fruit), it is often advantageous to increase the expression of 
crystal proteins in these tissues over that obtained with the CaMV35S promoter. 

The 35S promoter of Figwort Mosaic Virus (FMV) is analogous to the CaMV35S 
promoter. This promoter has been isolated and engineered into a plant transformation vector. 
Relative to the CaMV promoter, the FMV 35S promoter is highly expressed in the floral tissue, 

15 while still providing similar high levels of gene expression in other tissues such as leaf A plant 
transformation vector, may be constructed in which the full length synthetic cry gene is driven by 
the FMV 35S promoter. Tobacco plants may be transformed with the vector and compared for 
expression of the crystal protein by Western blot or ELISA immunoassay in leaf and floral 
tissue. The FMV promoter has been used to produce relatively high levels of crystal protein in 

20 floral tissue compared to the CaMV promoter. 

6.12 Example 12 - Expression of Synthetic ay Genes with ssRUBISCO Promoters and 
Chioroplast Transit Peptides 

The genes in plants encoding the small subunit of RUBISCO (SSU) are often highly 
expressed, light regulated and sometimes show tissue specificity. These expression properties 

25 are largely due to the promoter sequences of these genes. It has been possible to use SSU 
promoters to express heterologous genes in transformed plants. Typically a plant will contain 
multiple SSU genes, and the expression levels and tissue specificity of different SSU genes will 
be different. The SSU proteins are encoded in the nucleus and synthesized in the cytoplasm as 
precursors that contain an N-terminal extension known as the chioroplast transit peptide (CTP). 

30 The CTP directs the precursor to the chioroplast and promotes the uptake of the SSU protein into 
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thechlotoplast. In this process, the CTP is cleaved from the SSU protein. These CTP sequences 
have been used to direct heterologous proteins into chloroplasts of transformed plants. 

The SSU promoters might have several advantages for expression of heterologous genes 
in plants. Some SSU promoters are very highly expressed and could give rise to expression 
levels as high or higher than those observed with the CaMV35S promoter. The tissue 
distribution of expression from SSU promoters is different from that of the CaMV35S promoter, 
so for control of some insect pests, it may be advantageous to direct the expression of crystal 
proteins to those cells in which SSU is most highly expressed. For example, although relatively 
constitutive, in the leaf the CaMV35S promoter is more highly expressed in vascular tissue than 
in some other parts of the leaf, while most SSU promoters are most highly expressed in the 
mesophyll cells of the leaf. Some SSU promoters also are more highly tissue specific, so it could 
be possible to utilize a specific SSU promoter to express the protein of the present invention in 
only a subset of plant tissues, if for example expression of such a protein in certain cells was 
found to be deleterious to those cells. For example, for control of Colorado potato beetle in 
potato, it may be advantageous to use SSU promoters to direct crystal protein expression to the 

leaves but not to the edible tubers. 

Utilizing SSU CTP sequences to localize crystal proteins to the chloroplast might also be 
advantageous. Localization of the B. thuringiemis crystal proteins to the chloroplast could 
protect these from proteases found in the cytoplasm. TTiis could stabilize the proteins and lead to 
higher levels of accumulation of active toxin, cry genes containing the CTP may be used in 
combination with the SSU promoter or with other promoters such as CaMV35S. 
6.13 Example 13 ~ Targeting of Cry Proteins to the ExtraceUular Space or Vacuole 
through the Use of Signal Peptides 

The B. thuringiensis proteins produced from the synthetic genes described here are 
localized to the cytoplasm of the plant cell, and this cytoplasmic localization results in plants that 
are insecticidally effective. It may be advantageous for some purposes to direct the 
B. thuringiensis proteins to other compartments of the plant cell. Localizing B. thuringiensis 
proteins in compartments other than the cytoplasm may result in less exposure of the 
B. thuringiensis proteins to cytoplasmic proteases leading to greater accumulation of the protein 
yielding enhanced insecticidal activity. Extracellular localization could lead to more efficient 
exposure of certain insects to the B. thuringiensis proteins leading to greater efficacy. If a 
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B. ihuringiensis protein were found to be deleterious to plant cell function, then localization to a 
noncytoplasmic compartment could protect these cells from the protein. 

In plants as well as other eukaryotes» proteins that are destined to be localized either 
extracellularly or in several specific compartments are typically synthesized with an N-terminal 
5 amino acid extension known as the signal peptide. This signal peptide directs the protein to enter 
the compartmentalization pathway, and it is typically cleaved from the mature protein as an early 
step in compartmentalization. For an extracellular protein, the secretory pathway typically 
involves cotranslational insertion into the endoplasmic reticulum with cleavage of the signal 
peptide occurring at this stage. The mature protein then passes through the Golgi body into 
10 vesicles that fuse with the plasma membrane thus releasing the protein into the extracellular 
space. Proteins destined for other compartments follow a similar pathway. For example, 
proteins that are destined for the endoplasmic reticulum or the Golgi body follow this scheme, 
but they are specifically retained in the appropriate compartment. In plants, some proteins are 
also targeted to the vacuole, another membrane bound compartment in the cytoplasm of many 

15 plant cells. Vacuole targeted proteins diverge from the above pathway at the Golgi body where 
they enter vesicles that fuse with the vacuole. 

A common feature of this protein targeting is the signal peptide that initiates the 
compartmentalization process. Fusing a signal peptide to a protein will in many cases lead to the 
targeting of that protein to the endoplasmic reticulum. The efficiency of this step may depend on 

20 the sequence of the mature protein itself as well. The signals that direct a protein to a specific 
compartment rather than to the extracellular space are not as clearly defined. It appears that 
many of the signals that direct the protein to specific compartments are contained within the 
amino acid sequence of the mature protein. This has been shown for some vacuole targeted 
proteins, but it is not yet possible to define these sequences precisely. It appears that secretion 

25 into the extracellular space is the "default" pathway for a protein that contains a signal sequence 
but no other compartmentalization signals. Thus, a strategy to direct B. thuringiensis proteins out 
of the cytoplasm is to fuse the genes for synthetic 5. thuringiensis genes to DNA sequences 
encoding known plant signal peptides. These fusion genes will give rise to B. thuringiensis 
proteins that enter the secretory pathway, and lead to extracellular secretion or targeting to the 

30 vacuole or other compartments. 
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Signal sequences for several plant genes have been described. One such sequence is for 
the tobacco pathogenesis related protein PR lb has been previously described (Comelissen et al, 
1986). The PRlb protein is normally localized to the extracellular space. Another type of signal 
peptide is contained on seed storage proteins of legumes. These proteins are localized to the 
protein body of seeds, which is a vacuole like compartment found in seeds. A signal peptide 
DNA sequence for the p-subimit of the 7S storage protein of conmion bean (Phaseolus vulgaris), 
PvuB has been described (Doyle et al.y 1986). Based on the published these published 
sequences, genes may be synthesized chemically using oligonucleotides that encode the signal 
peptides for PRlb and PvuB. In some cases to achieve secretion or compartmentalization of 
heterologous proteins, it may be necessary to include some amino acid sequence beyond the 
normal cleavage site of the signal peptide. This may be necessary to insure proper cleavage of 
the signal peptide. 

6.14 Example 14 Isolation of Transgenic Plants Resistant to Insects Using cry 
Transgenes 

6.64.1 Plant Gene Construction 

The expression of a plant gene which exists in double-stranded DNA fomi involves 
transcription of messenger RNA (mRNA) from one strand of the DNA by RNA polymerase 
enzyme, and the subsequent processing of the mRNA primary transcript inside the nucleus. This 
processing involves a 3' non-translated region which adds polyadenylate nucleotides to the 3' 
end of the RNA. Transcription of DNA into mRNA is regulated by a region of DNA usually 
referred to as the "promoter". The promoter region contains a sequence of bases that signals 
RNA polymerase to associate with the DNA and to initiate the transcription of mRNA xising one 
of the DNA strands as a template to make a corresponding strand of RNA. 

A number of promoters which are active in plant cells have been described in the 
literature. Such promoters may be obtained from plants or plant viruses and include, but are not 
limited to, the nopaline synthase (NOS) and octopine synthase (OCS) promoters (which are 
carried on tumor-inducing plasmids of Agrobacterium tumefaciens), the cauliflower mosaic virus 
(CaMV) 19S and 35S promoters, the light-inducible promoter from the small subunit of ribulose 
1,5-bisphosphate carboxylase (ssRUBISCO, a very abundant plant polypeptide), and the Figwort 
Mosaic Virus (FMV) 35S promoter. All of these promoters have been used to create various 
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types of DNA constructs which have been expressed in plants (see eg., U. S. Patent No. 
5,463,175, specifically incorporated herein by reference). 

The particular promoter selected should be capable of causing sufiBcient expression of the 
enzyme coding sequence to result in the production of an effective amount of protein. One set 
of preferred promoters are constitutive promoters such as the CaMV35S or FMV35S promoters 
that yield high levels of expression in most plant organs (U. S. Patent No. 5,378,619, specifically 
incorporated herein by reference). Another set of preferred promoters are root enhanced or 
specific promoters such as the CaNfV derived 4 as-1 promoter or the "wheat POXl promoter (U. 
S. Patent No. 5,023,179, specifically incorporated herein by reference; Hertig et al^ 1991). The 
root enhanced or specific promoters would be particularly preferred for the control of com 
rootworm {Diabroticus spp.) in transgenic com plants. 

The promoters used in the DNA constructs {i.e. chimeric plant genes) of the present 
invention may be modified, if desired, to affect their control characteristics. For example, the 
CaMV35S promoter may be ligated to the portion of the ssRUBISCO gene that represses the 
expression of ssRUBISCO in the absence of light, to create a promoter which is active in leaves 
but not in roots. The resulting chimeric promoter may be used as described herein. For purposes 
of this description, the phrase "CaMV35S" promoter thus includes variations of CaMV35S 
promoter, e.g., promoters derived by means of ligation with operator regions, random or 
controlled mutagenesis, etc. Furthermore, the promoters may be altered to contain multiple 
'^enhancer sequences" to assist in elevating gene expression. 

The RNA produced by a DNA constmct of the present invention also contains a 5' non- 
translated leader sequence. This sequence can be derived from the promoter selected to express 
the gene, and can be specifically modified so as to increase translation of the mRNA. The 5' 
non-translated regions can also be obtained firom viral RNA's, fi-om suitable eucaryotic genes, or 
fi-om a synthetic gene sequence. The present invention is not limited to constructs wherein the 
non-translated region is derived fix)m the 5' non-translated sequence that accompanies the 
promoter sequence. 

For optimized expression in monocotyledenous plants such as maize, an intron should 
2ilso be included in the DNA expression construct. This intron would typically be placed near 
the 5' end of the mRNA in xintranslated sequence. This intron could be obtained from, but not 
limited to, a set of introns consisting of the maize hsp70 intron (U. S. Patent No. 5,424,412; 
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specifically incorporated herein by reference) or the rice Actl intron (McElroy et al., 1990). As 
shown below, the maize hsp70 intron is useful in the present invention. 

As noted above, the 3' non-translated region of the chimeric plant genes of the present 
invention contains a polyadenylation signal which functions in plants to cause die addition of 
adenylate nucleotides to the 3' end of the RNA. Examples of preferred 3' regions are (1) the 3' 
transcribed, non-translated regions containing the polyadenylate signal of Agrobacterium tumor- 
inducing (Ti) plasmid genes, such as the nopaline synthase (NOS) gene and (2) plant genes such 
as the pea ssRUBISCO E9 gene (FischhofF et al, 1987). 

6.14.2 Plant Transformation and Expression 

A plant gene containing a structural coding sequence of the present invention can be 
inserted into the genome of a plant by any suitable method. Suitable plant transformation vectors 
include those derived ftom a Ti plasmid of Agrobacterium tumefaciens, as well as diose 
disclosed, e.g., by Herrera-Estrella (1983), Bevan (1983), Klee (1985) and Eur. Pat Appl. Publ. 
No. EP0120516. In addition to plant transformation vectors derived from the Ti or toot-inducing 
(Ri) plasmids of Agrobacterium, alternative methods can be used to insert the DNA constructs of 
this invention into plant cells. Such methods may involve, for example, the use of liposomes, 
electroporation, chemicals that increase free DNA uptake, free DNA delivery via microprojectile 
bombardment, and transformation using viruses or pollen (Fromm et al, 1986; Armstrong et al, 
1 990; Fromm et ai , 1 990). 

6.143 Construction of Monocot Plant Expression Vectors for cry Genes 

For efiBcient expression of cry genes in transgenic plants, the gene must have a suitable 
sequence composition (Diehn et al, 1996). To place the cry gene in a vector suitable for 
expression in monocotyledonous plants {i.e. under control of the enhanced Cauliflower Mosaic 
Virus 35S promoter and link to the hsp70 intron followed by a nopaline synthase 
polyadenylation site as in U. S. Patent No. 5,424,412, specifically incorporated herein by 
reference), a vector such as pMON 19469 may be used. Such a vector is conveniently digested 
with Ncol and £coRI restriction enzymes. The larger vector band of approximately 4.6 kb is 
then electrophoresed, purified, and ligated with T4 DNA ligase to an Nco\-Eco^ fragment 
which contains the synthetic cry gene. The ligation mix is then transformed into E. coli, 
carbenicillin resistant colonies recovered and plasmid DNA recovered by DNA miniprep 
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procedures. The DNA is then subjected to restriction endonuclease analysis with enzymes such 
as Ncol and EcoU (together), M>/I, and/or Pstl individually or in combination, to identify clones 
containing the cry coding sequence fused to an intron such as the hsp70 intron, placed under the 
control of the enhanced CaMV35S promoter. 

To place the gene in a vector suitable for recovery of stably transformed and insect 
resistant plants, the 3.75-kb Nod restriction fiagment from pMON33708 containing the lysine 
oxidase coding sequence fiised to the hsp70 intron under control of the enhanced CaMV35S 
promoter may be isolated by gel electrophoresis and purification. This fragment is then ligated 
with a vector such as pMON30460 which has been previously treated with Notl and calf 
intestinal alkaline phosphatase {pMON30460 contains the neomycin phosphotransferase coding 
sequence under control of the CaMV35S promoter). Kanamycin resistant colonies may then be 
obtained by transformation of this ligation mix into E. coli and colonies containing the desired 
plasmid may be identified by restriction endonuclease digestion of plasmid miniprep DNAs. 
Restriction enzymes such as Notl, EcoRY, //mdlll, Ncol, EcoRly and Bglll may be used to 
identify the appropriate clones in which the orientation of both genes are in tandem (J.e. the 3' 
end of the cry expression cassette is linked to the 5' end of the nptll expression cassette). 
Expression of the Cry protein by the resulting plasmid in com protoplasts may be confirmed by 
electroporation of the vector DNA into protoplasts followed by protein blot and ELISA analysis. 
This vector may be introduced into the genomic DNA of com embryos by particle gun 
bombardment followed by paromomycin selection to obtain com plants expressing the cry gene 
essentially as described in U. S. Patent No. 5,424,412, specifically incorporated herein by 
reference. 

As an example, the vector may be introduced via cobombardment with a hygromycin 
resistance conferring plasmid into immature embryo scutella (lES) of maize, followed by 
hygromycin selection, and regeneration. Transgenic com lines expressing the cry protein may 
then be identified by ELISA analysis. Progeny seed from these events may then be subsequently 
tested for protection from insect feeding. 
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The following references, to the extent that they provide exemplary procedural or other 
details supplementary to those set forth herein, are specifically incorporated herein by reference. 



wo 01/19859 



PCT/USOO/25361 



-84- 

U. S. Patent 4,196,265, issued Apr. 1, 1980. 
U. S. Patent 4,554,101, issued Nov. 19, 1985. 
U. S. Patent 4,683,195, issued Jul. 28, 1987. 
U. S. Patent 4.683,202, issued Jul. 28, 1987. 
5 U. S. Patent 4,757,011, issued Jul. 12, 1988. 
U. S. Patent 4,769,061, issued Sep. 6, 1988. 

U. S. Patent 4,940,835, issued Feb. 23, 1990. 

U. S. Patent 4,965,188, issued Oct. 23, 1990. 

U. S. Patent 4,971,908, issued Nov. 20, 1990. 
10 U. S. Patent 4,987,071 , issued Jan. 22, 1991 . 

U. S. Patent 5,023,179, issued Jun 11, 1991. 

U. S. Patent 5,176,995, issued Oct. 15, 1991. 

U. S. Patent 5,334,71 1, issued Aug. 2, 1994. 

U. S. Patent 5,378,619, issued Jan 3, 1995. 
15 U. S. Patent 5,384,253, issued Jan. 24, 1995. 

U. S. Patent 5,424,412, issued Jun 13, 1995. 

U. S. Patent 5,463,175, issued Oct 31, 1995. 

U. S. Patent 5,631,359, issued May 20, 1997. 

Int. Pat. Appl. Publ. No. WO 84/02913. 
20 Int. Pat. Appl. Publ. No. WO 91/03162. 

Int. Pat. Appl. Publ. No. WO 92/07065. 

Int. Pat. Appl. Publ. No. WO 93/15187! 

Int. Pat. Appl. Publ. No. WO 93/23569. 

Int. Pat. Appl. Publ. No. WO 94/02595. 
25 Int. Pat Appl. Publ. No. WO 94/1 3688. 

Eur. Pat. Appl. Publ. No. EP0120516. 

Eur. Pat. Appl. Publ. No. EP0360257. 

Eur. Pat. Appl. Publ. No. 921 10298.4 

30 Arantes and Lereclus, Gene. 108:1 15-119, 1991. 
Abdullah et ai. Biotechnology, 4:1087, 1986. 



wo 01/19859 



PCT/USOO/25361 



-85- 

Baum et ai.Appl Environ, MicrobioLy 56:3420-3428, 1990. 

Benbrook et aL, In: Proceedings Bio Expo 1986, Butterworth, Stoneham, MA, pp. 27-54, 1986. 
Bevan et aL, Nucleic Acids Res., ll(2):369-85, 1983. 
Bytebierer a/.,-Prac. NatL Acad Sci, USA, 84:5345, 1987. 
Callis et aL, Genes and Development, 1 :1 183, 1987. 

Campbell, "Monoclonal Antibody Technology, Laboratory Techniques in Biochemistry and 
Molecular Biology," Vol. 13, Burden and Von Knippenberg, Eds. pp. 75-83, Elsevier, 
Amsterdam, 1984. 

Capecchi, "High efficiency transformation by direct microinjection of DNA into cultured 

mammalian cells," Cell, 22(2):479-488, 1980. 
Cashmore et ai. Gen. Eng. of Plants, Plenum Press, New York, 29-38, 1983. 
Charles et aL, Annu. Rev. EntomoL, 41 :45 1-472, 1996. 
Chau et al , Science, 244: 1 74-1 8 1 , 1 989. 
Chen et al,Nucl. Acids Res., 20:4581-9, 1992. 
Chowrira and Burke, Nucl Acids Res., 20:2835-2840, 1992. 

Clapp, "Somatic gene therapy into hematopoietic cells. Current status and future implications," 

Clin. Perinatol., 20(1):155-168, 1993. 
Collins and Olive, Biochem,, 32:2795-2799, 1993. 

Conway and Wickens, In: UNA Processing, p. 40, Cold Spring Harbor Laboratory, Cold Spring 

Harbor, NY, 1988. 
Comelissene/a/., Nature, 321(6069):531-2, 1986. 
Crickmore et al., Microbiol. Mol. Biol. Rev. 62:807-813, 1998. 
Cristou etal. Plant Physiol, 87:671-674, 1988. 

Curiel, Agarwal, Wagner, Cotten, "Adenovirus enhancement of transferrin-polylysine-mediated 
gene delivery," Proc. Natl Acad Sci. USA, 88(19):8850.8854, 1991. 

Curiel, Wagner, Cotten, Bimstiel, Agarwal, Li, Loechel, Hu, "High-efficiency gene transfer 
mediated by adenovirus coupled to DNA-poly lysine complexes," Hum, Gen. Ther., 
3(2):147.154, 1992. 

Dean et al, Nucl Acids Res., 14(5):2229, 1986. 



wo 01/19859 



PCT/USOO/25361 



-86- 

Dhir, S. K., Dhir, S., Hepburn, A., and Widholm, J. M., "Factors affecting transient gene 
expression in electroporated Glycine-max protoplasts," Plant Cell Rep., 10(2): 106- 110, 
1991. 

Dhir, S. K., Dhir, S., Sturtevant, A. P., and Widholm, J. M., "Regeneration of transformed shoots 
for electroporated soybean Glycine-max L. Merr. Protoplasts, Plant Cell Rep., 10(2):97- 
101, 1991. 

Diehn et al. Genet. Eng. (N.Y.). 18:83-99, 1996. 
Donovan etal. J. Biol. Chem. 263:561-567, 1988. 
Donovan etal. Appl. Environ. Microbiol 58:3921-3927, 1992. 
Y>oy\c etal. J. Biol Chem.. 261(20):9228-38, 1986. 
Dropulic et al, J. Virol, 66:1432-41, 1992. 

Eglitis and Anderson, "Retroviral vectors for introduction of genes into mammalian cells," 

Biotechniques, 6(7):608-614, 1988. 
Eglitis, Kantoff, Kohn, Karson, Moen, Lothrop, Blaese, Anderson, "Retroviral-mediated gene 

transfer into hemopoietic cells," Avd. Exp. Med. Biol, 241:19-27, 1988. 
Elroy-Stein and Moss, Proc. Natl Acad. Set USA, 87:6743-7, 1990. 
English and Slatin, Insect Biochem. Mol Biol, 22:1-7, 1992. 
Fraley et al. Biotechnology, 3:629, 1985. 
Fraley et al , Proc. Natl Acad. Set USA, 80:4803, 1 983. 
FTomm etal. Biotechnology (N.Y.), 8(9):833-9, 1990. 
Fromm et al , Nature, 319:791 -793, 1 986. 

Fromm, Taylor, Walbot, "Expression of genes transferred into monocot and dicot plant cells by 

electroporation," Proc. Natl Acad. Sci. USA, 82(17):5824-5828, 1985. 
Fujimura el al. Plant Tiss. Cult. Lett., 2:74, 1985. 

Fynan, Webster, Fuller, Haynes, Santoro, Robinson, "DNA vaccines: protective immunizations 
by parenteral, mucosal, and gene gun inoculations," Proc. Natl Acad Scl USA, 
90(24): 11478-1 1482, 1993. 

Gao and Huang, Nucl Acids Res., 21:2867-72, 1993. 

Gefter et al , Somat. Cell Genet., 3 :23 1 -236, 1 977. 

Genovese and Milcarek, In: RNA Processing, p. 62, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY, 1988. 



wo 01/19859 



PCTAJSOO/25361 



-87- 

Gil and Proudfoot, Nature, 312:473, 1984. 

Coding, "Monoclonal Antibodies: Principles and Practice," pp. 60-74. 2nd Edition, Academic 

Press, Orlando, FL, 1986. 
Graham and van der Eb, "Transformation of rat cells by DNA of human adenovirus 5," Virology, 

54(2):536-539, 1973. 
Guerrier-Takada et al.. Cell, 35:849, 1983. 
Hampel and Tritz, Blochem., 28:4929, 1989. 
Hampel et al. , Nucl. Acids Res. , 1 8:299, 1 990. 

Harlow and Lane, "Antibodies: A Laboratory Manual," Cold Spring Harbor Laboratory, Cold 

Spring Harbor, NY, 1988. 
Herrera-Estrellaera/., Embo. J.. 2(6):987-996, 1983. 
Hertige/fl/., Plant Mol. Biol, 16(l):171-4, 1991. 
Hess, Intern Rev. Cytol, 107:367, 1987. 
Hofte et al , Microbiol Rev., 53 :242-255, 1 989. 

Horsch, R. B., Fry, J. E., Hoffinann, N. L., Eichholtz, D., Rogers, S. G., and Fraley, R. T., "A 
simple and general method for transferring genes into plants," Science, 227(469 1): 1229- 
1231, 1985. 

Jameson and Wolf, "The Antigenic Index: A Novel Algorithm for Predicting Antigenic 

Determinants," Compu. Appl. Biosci., 4(l):181-6, 1988. 
Johnston and Tang, "Gene gun transfection of animal cells and genetic immunization," Methods 

Cell Biol. 43(A):353-365, 1994. 
Jorgensen et al., Mol. Gen. Genet., 207:471, 1987. 
Kashani-Sabet et al. , Antisense Res. Dev. , 2:3-1 5, 1 992. 
Keller et al., EMBO J., 8:1309-14, 1989. 

Klee, H. J., Yanofsky, M. F., and Nester, E. W., "Vectors for transformation of higher plants," 

Bio-Technology, 3(7):637-642, 1985. 
Klein et al.. Nature. 327:70, 1987. . 

Klein et al., Proc. Natl Acad Sci. USA, 85:8502-8505, 1988. 
Kohler and Milstein, Eur. J. Immunol., 6:51 1-519, 1976. 
Kohler and Milstein, Nature, 256:495-497, 1975. 
Kreig et al.. In: Zangew. Era., 96:500-508, 1983. 



wo 01/19859 



PCTAJSOO/25361 



-88- 

Kyte and Doolittle, A simple method for displaying the hydropathic character of a protein," J. 

Mol Biol., 157(1):105-132, 1982. 
Lambert etal.,Appl. Environ. Microbiol., 58:2536-2642, 1992B. 
Lambert et al. Gene, 1 10:131-132, 1992A. 
Langridge etal, Proc. Natl. Acad. Sci. USA, 86:3219-3223, 1989. 
Lee era/., Biochem. Biophys. Res. Comm. 229:139-146. 
L'Huilliere/a/.,^A/50y., n:44Il-8, 1992. 
Lieber et ai , Methods Enzymol , 2 1 7:47-66, 1 993. 
Lindstrom et al.. Developmental Genetics, 1 1:160, 1990. 
Usaevncz etal., Proc. Natl Acad. Sci. U.S.A., 90:8000-4, 1993. 
LoTzetal.,Mol Gen. Genet., 199:178, 1985. 

Lu, Xiao, Clapp, Li, Broxmeyer, "High efficiency retroviral mediated gene transduction into 
single isolated immature and replatable CD34(3+) hematopoietic stem/progenitor cells 
from human umbilical cord blood," J. Exp. Med., 178(6):2089-2096, 1993. 

Luo et al.. Plant Mol. Biol Reporter, 6:165, 1988. 

Maddock et al.. Third International Congress of Plant Molecular Biology, Abstract 372, 1991. 
Maloy et al., "Microbial Genetics" 2nd Edition. Jones and Barlett Publishers, Boston, MA, 1994. 
Maloy, S.R., "Experimental Techniques in Bacterial Genetics" Jones and Bartlett Publishers, 
Boston, MA, 1990. 

Maniatis et al, "Molecular Cloning: a Laboratory Manual," Cold Spring Harbor Laboratory, 

Cold Spring Harbor, NY., 1982. 
Marcotte et ai , Nature, 335:454, 1 988. 
McCabe et al. Biotechnology, 6:923, 1988. 
McDevitt et al. Cell, 37:993-999, 1984. 

McElroy, Zhang, Wu, "Isolation of an efficient promoter for use in rice transformation," Plant 

Cg//, 2:163-171, 1990. 
Mettus and Macaluso, zip/?/. Environ. Microbiol 56:1128-1134, 1990 
Neuhaus et al, Theor. Appl Genet., 75:30, 1987. 
Odell etal. Nature, 313:810, 1985. 

Ohkawa, Yuyama, Taira, "Activities of HIV-RNA targeted ribozymes transcribed from a 'shot- 
gun' type ribozyme-trinuning piasmid," Nwl Acids Symp. Ser., 27:15-6, 1992. 



wo 01/19859 



PCTAJSOO/25361 



-89- 

Ojwang et al, Proc. Natl Acad. Set USA, 89:10802-6, 1992. 
OminiUeh et al.. Plant Mol. Biol., 21 :415-428, 1993. 

Pandey and Marziuflf, In "RNA Processing," p. 133, Cold Spring Harbor Laboratory, Cold 

Spring Harbor, NY, 1987. 
Pena etal.. Nature, 325:274, 1987. 
Penault et al. Nature, 344:565, 1990. 
Perrotta and Been, Biochem., 31:16, 1992. 
Pieken etal. Science, 253:314, 1991. 
Poszkowski etal, EMBOJ., 3:2719, 1989. 
Potrykuse/ o/.,M?/. Gen. Genet., 199:183, 1985. 
Poulsen er a/.. Mo/. Gen Genet., 205:193-200, 1986. 
Prokop and Bajpai, , Ann. N. Y. Acad. Sci., 646, 1 991 . 

Rogers et al. In: Methods For Plant Molecular Biology, A. Weissbach and H. Weissbach, eds.. 

Academic Press Inc., San Diego, CA 1988. 
Rogers etal. Methods Enzymol, 153:253-277, 1987. 
Rossi et al,Aids Res. Hum. Retrovir., 8:183, 1992. 
Sadofsky and Alwine, Mol Cell Biol, 4(8): 1460- 1468, 1984. 

Sambrook et al, "Antibodies: A Laboratory Manual," Cold Spring Harbor Laboratory, Cold 

spring Harbor, NY, 1989. 
Sambrook et al, "Molecular Cloning: A Laboratory Manual," Cold Spring Harbor Laboratory, 

Cold Spring Harbor, NY, 1 989. 
Sanger et al, "DNA sequencing with chain-terminating inhibitors," Proc. Natl Acad. ScL USA, 

74(12):5463-5467, 1977. 
Sarvere/a/.. Science, 247(4947): 1222-5, 1990. 
Saville and Collins, Cell, 61 :685-696, 1990. 

Saville and Collins, Proc. Natl Acad. ScL USA, 88:8826-8830, 1991. 
Scanlon et al, Proc. Natl Acad ScL USA, 88:10591-5, 1991. 
Scaringee/a/.,iV«<:/. Acids Res., 18:5433-5441, 1990. 
Schnepf er al, Microbiol Mol Biol Rev. 62:775-806, 1998. 
Shaw and Kamen, Cell, 46:659-667, 1986. 



wo 01/19859 



PCTAJSOO/25361 



-90- 

Shaw and Kamen, In: "RNA Processing", p. 220, Cold Spring Harbor Laboratory, Cold Spring 

Harbor, NY, 1987. 
Sick etaI.,NucL Acids Res., 18:1305, 1990. 
Simpson, Science, 233:34, 1986. 
Spielmann etai, Mol. Gen. Genet., 205:34, 1986. 
T3aiaetaI.,Nucl. Acids Res., 19:5125-30, 1991. 
Toriyama etai, TheorAppl. Genet., 73:16, 1986. 
Uchimiyae/a/.,M?/. Gen. Genet., 204:204, 1986. 
UsmaaetaL,J. Am. Chem. Soc, 109:7845-7854, 1987. 
UsmanandCedergren, TIBS, 17:34, 1992. 
Van Tunen et al.. EMBOJ.. 7:1257, 1988. 

Vasil et al., "Herbicide-resistant fertile transgenic plants obtained by microprojectile 

bombardment ofregenerableembryogenic callus," 5io/cc/iMo/ogy, 10:667-674, 1992. 
Vasil, Biotechnology, 6:397, 1 988. 
Ventura et al., Nucl. Acids Res., 21 :3249-55, 1993. 
Vodkmetal.,Cell. 34:1023, 1983. 

Vogel, J. M., Dawe, R. K., and Freeling, M., "Regulation of the cell type-specific expression of 
maize Adhl and Shi electroporation-directed gene transfer into protoplasts of several 
maize tissues," y. Cell. Biochem., (Suppl. 0) 13:PartD, 1989. 

Von Tersch, M. A., Robbins, H. L., Jany, C. S., and Johnson, T., Appl Environ. Microbiol. 
57:349-358,1991. 

Wagner et al, "Coupling of adenovirus to transferrin-polylysine/DNA complexes greatly 
enhances receptor-mediated gene delivery and expression of transfected genes," Proc. 
Natl. Acad Sci. USA, 89(13):6099-6103, 1992. 

Weerasingheera/.,J. K/ro/., 65:5531-4, 1991. 

Weissbach and Weissbach, Methods for Plant Molecular Biology, (eds.). Academic Press, Inc., 

San Diego, CA, 1988. 
"Weraier etai.. Plant Mol. Biol., 12:41-50, 1989. 
Wickens and Stephenson, Science, 226:1045, 1984. 

Wickens et al.. In: "RNA Processing," p. 9, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY, 1987. 



wo 01/19859 



PCT/USOO/25361 



-91- 

Widner, W. R., and Whiteley, H. R., J. Bacteriol, 172:2826-2832, 1990. 

Wong and Neumann, "Electric field mediated gene transfer," Biochim. Biophys, Res. Commun,^ 

107(2):584-587, 1982. 
Woolf e/a/., Proc. Natl Acad, ScL USA, 89:7305-7309, 1992. 
Yamada et aL, Plant Cell Rep., 4:85, 1986. 
Yang et al, Proc. Natl Acad. ScL USA, 87:4144-48, 1990. 
Yu et ai, Proc, Natl Acad. Set USA, 90:6340-4, 1993. 

Zatloukal, Wagner, Gotten, Phillips, Plank, Steinlein, Curiel, Bimstiel, "Transfeirinfection: a 
highly efficient way to express gene constructs in eukaryotic cells," Ann, N Y. Acad Scl, 
660:136-153, 1992. 
Zhou et al. Methods EnzymoL, 101 :433, 1983. 
Zhou etal,Mol Cell Biol, 10:4529-37, 1990. 

All of the compositions and methods disclosed and claimed herein can be made and 
executed without imdue experimentation in light of the present disclosure. While the 
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CLAIMS: 

1 . An isolated polypeptide at least 85% identical to SEQ ID NO: 1 8, SEQ ID NO:20, SEQ ID 
NO:24, SEQ ID NO:40, or SEQ ID NO:44. 

2. An isolated polypeptide at least 91% identical to SEQ ID N0:2, SEQ ID N0:6, SEQ ID 
NO:8, SEQ ID NO:10, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:24, SEQ ID NO:38, 
SEQ ID NO:40, or SEQ ID NO:44. 

3. An isolated polypeptide at least 95% identical to SEQ ID NO:2, SEQ ID NO:6, SEQ ID 
NO:8, SEQ ID NO:10, SEQ ID NO:16, SEQ ID N0:18, SEQ ID NO:20, SEQ ID NO:22. 
SEQ ID NO:24, SEQ ID NO:28, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ 
ID NO:44, or SEQ ID NO:50. 

4. An isolated polypeptide at least 99% identical to SEQ ID NO:2. SEQ ID NO:4, SEQ ID 
NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14. SEQ ID NO: 16, 
SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ 
ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID 
NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44. SEQ ID NO:46, SEQ ID 
NO:48, SEQ ID NO:50 or SEQ ID NO: 63. 

5. The poljTJeptide of claim 4, comprising an amino acid sequence selected from the groiq> 
consisting of SEQ ID N0:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID N0:8, SEQ ID 
NO:10, SEQ ID NO:12, SEQ ID N0:14, SEQ ID N0:16, SEQ ID NO:18, SEQ ID 
NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID N0:26, SEQ ID NO:28, SEQ ID 
NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID 
NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID 
NO:50 and SEQ ID NO: 63. 

6. An isolated nucleic acid sequence encoding the polypeptide of any preceding claim. 

7. A composition comprising the polypeptide of any of claims 1 to 5, and a diluent. 

8. The composition of claim 7, wherein the polypeptide is selected from the group consisting 
of SEQ ID N0:2, SEQ ID N0:4, SEQ ID N0:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID 
NO:12, SEQ ID NO:14, SEQ ID N0:16, SEQ ID N0:18, SEQ ID NO:20. SEQ ID 
NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID 



wo 01/19859 



PCT/USOO/25361 



-93- 

NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID 
NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50 and SEQ ID 
NO: 63. 

9. The composition of claim 7, comprising a cell extract, cell suspension, cell homogenate, 
cell lysate, cell supernatant, cell filtrate, or cell pellet of Bacillus thuringiensis cells, 

10. The composition of claim 7, wherein said composition is a powder, dust, pellet, granule, 
spray, emulsion, colloid, or solution. 

1 1 . The composition of claim 7, comprising from about 1% to about 99% by weight of said 
polypeptide. 

12. An insecticidal polypeptide prepared by a process comprising the steps of: 

(a) culturing a Bacillus thuringiensis cell having the accession number NRRL B- 

21784, NRRL B-21783, NRRL B-21917, NRRL B-21786, NRRL B-21787, NRRL B- 

21785, NRRL B-21788, NRRL B-21915 or NRRL B-21916 under conditions effective to 
produce an insecticidal polypeptide; and 

(b) obtaining from said cell the insecticidal polypeptide so produced. 

13. A Bacillus thuringiensis cell having the NRRL accession number NRRL B-21784, 
NRRL B-21783, NRRL B-21917, NRRL B-21786, NRRL B-21787, NRRL B-21785, 
NRRL B-21788, NRRL B-21915 or NRRL B-21916. 

14. An isolated polynucleotide at least 85% identical to SEQ ID NO:17, SEQ ID NO:19, SEQ 
ID NO:23, SEQ ID NO:39, or SEQ ID NO:43. 

1 5. The polynucleotide of claim 1 5, wherein the polynucleotide is at least 95% identical to SEQ 
ID NO:l, SEQ ID N0:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 15, SEQ ID NO: 17, 
SEQ ID NO: 19, SEQ ID N0:21, SEQ ID NO:23, SEQ ID NO:27, SEQ ID NO:35, SEQ 
ID NO:37, SEQ ID NO:39, SEQ ID NO:43, or SEQ ID NO:49. 

1 6. The polynucleotide of claim 1 5, wherein the polynucleotide is at least 99% identical to SEQ 
ID NO:l, SEQ ID N0:3, SEQ ID N0:5, SEQ ID NO:7, SEQ ID N0:9, SEQ ID NO:l 1, 
SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ 
ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID N0:31, SEQ ID 
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NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID N0:4l, SEQ ID 
NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49, or SEQ ID NO:62. 

17. The polynucleotide of claim 15, comprising the nucleic acid sequence of SEQ ID N0:1, 
SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID 

5 N0:13. SEQ ID NO:15, SEQ ID N0:17, SEQ ID NO:19, SEQ ID NO:21. SEQ ID 

NO:23. SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID 
NO:33, SEQ ID NO:35. SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:41. SEQ ID 
NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49, or SEQ ID NO:62. 

18. The polynucleotide of any of claims 14-17, wherein the isolated polynucleotide is 
10 provided in a vector. 

19. The polynucleotide of any of claims 14-17, wherein the isolated polynucleotide is 
operably linked to a promoter. 

20. The polynucleotide of claim 1 9, wherein the promoter is a plant-expressible promoter. 

21. The polynucleotide of claim 20, wherein the plant-expressible promoter is selected from 
15 the group consisting of com sucrose synthetase 1, com alcohol dehydrogenase 1, com 

light harvesting complex, com heat shock protein, pea small subunit RuBP carboxylase, 
Ti plasmid mannopme synthase, Ti plasmid nopaline synthase, petunia chalcone 
isomerase, bean glycine rich protein 1, Potato patatin, lectin, CaMV 35S, and the S-E9 
small subunit RuBP carboxylase promoter. 
20 22. The polynucleotide of claim 18, M^erein the vector is a plasmid, baculovims, artificial 
chromosome, virion, cosmid, phagemid, phage or viral vector. 

23. A transforaied host cell comprising a nucleic acid sequence encoding the polypeptide of any 
of claims 1 to S. 

24. The transformed host cell of claim 23, wherein the nucleic acid is selected from the group 
25 consisting of SEQ ID NO: 1 . SEQ ID N0:3, SEQ ID NO:5, SEQ ID N0:7, SEQ ID NO:9, 

SEQ ID NO:ll, SEQ ID NO:13, SEQ ID N0:15, SEQ ID NO:17, SEQ ID N0:19, SEQ 
ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27. SEQ ID NO:29, SEQ ID 
NO:31, SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID 
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NO:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49, and SEQ ID 
NO:62. 

25. The transformed host cell of claim 23, further defined as a prokaryotic or eukaryotic host 
cell. 

26. The transformed host cell of claim 23, further defined as a bacterial cell or a plant cell. 

27. The transformed host cell of claim 26, wherein said bacterial cell is a Bacillus thuringiensis^ 
Bacillus subtiliSy Bacillus megaterium. Bacillus cereus, Escherichia, Salmonella, 
Agrobacterium or Pseudomonas cell. 

28. The transformed host cell of claim 26, wherein said bacterial cell is a Bacillus thuringiemis 
NRRL 8-21784, NRRL B-21783, NRRL B-21917, NRRL B-21786, NRRL B-21787, 
NRRL B-21785, NRRL B-21788. NRRL 6-21915 or NRRL B-2I916 ceU. 

29. The transformed host cell of claim 27, wherein said bacterial cell is an Agrobacterium 
tumefaciens cell. 

30. The transformed host cell of claim 26, further defined as a monocotyledonous or 
dicotyledonous plant cell. 

31. The transformed host cell of claim 30, wherein said plant cell is selected from the group 
consisting of a com, wheat, soybean, oat, cotton, rice, rye, sorghum, sugarcane, tomato, 
tobacco, kapok, flax, potato, barley, turf grass, pasture grass, berry, fruit, legume, vegetable, 
ornamental plant, shrub, cactus, succulent, and tree cell. 

32. The transformed host cell of claim 30, wherein said plant cell is a com, wheat, rice, or 
sugarcane cell. 

33. The transformed host cell of claim 30, wherein said plant cell is a soybean, cotton, potato, 
tomato, or tobacco cell. 

34. A transgenic plant having incorporated into its genome a selected polynucleotide comprismg 
a first sequence region that encodes the polypeptide of any of claims 1 to 5. 

35. The transgenic plant of claim 34, wherein said first sequence region encodes SEQ ID N0:2, 
SEQ ID NO:4, SEQ ID N0:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 
NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID 
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NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30. SEQ ID NO:32. SEQ ID 
NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42. SEQ ID 
NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, or SEQ ID NO: 63. 

36. The transgenic plant of claim 34, wherein said first sequence region comprises SEQ ID 
NO:l, SEQ ID NO:3, SEQ IDNO:5, SEQ ID N0:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ 
ID NO:13, SEQ ID NO:15. SEQ ID NO:17, SEQ ID NO:19. SEQ ID NO:21, SEQ ID 
NO:23, SEQ ID NO:25, SEQ ID NO:27. SEQ ID NO:29, SEQ ID NO:31, SEQ ID 
NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:4I. SEQ ID 
NO:43. SEQ ID NO:45. SEQ ID NO:47, SEQ ID NO:49, or SEQ ID NO:62. 

37. The transgenic plant of claim 34, further defined as a monocotyledonous plant. 

38. The transgenic plant of claim 34, further defined as a com. wheat, oat, rice, barley, turf 
grass, or pasture grass plant 

39. The transgenic plant of claim 34, further defined as a dicotyledonous plant 

40. The transgenic plant of claim 34, further defmed as a legume, soybean, tobacco, tomato, 
potato, cotton, fruit, berry, vegetable or tree. 

41. A progeny of any generation of the transgenic plant of claim 34, wherein said progeny 
comprises said first selected sequence region. 

42. A seed of any generation of the plant of claim 34, wherein said seed comprises said first 
sequence region. 

43. A seed of any generation of the progeny of claim 39, wherein said seed comprises said first 
sequence region. 

44. A plant of any generation of the seed of claim 42 or 43, wherein said plant comprises said 
first sequence region. 

45. A method for controlling Lepdopteran insects comprising contacting said insect with the 
polypeptide of any of claims 1 to 5. 

46. The method of claim 45, wherein the polypeptide is provided in a powder, dust, pellet, 
granule, spray, emulsion, colloid, or solution. 

47. The method of claim 45, wherein the polypeptide is provided in a transformed host cell. 
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48. The method of claim 47, wherem the transformed host cell is a bacterial or plant cell. 

49. The method of claim 45, wherein the polypeptide is provided in a transgenic plant. 

50. The method of claim 49, wherein the plant is a com, cotton, or soybean plant. 

51. A method of preparing an insect resistant plant comprising: 

(a) contacting recipient plant cells with a polynucleotide composition 
comprising at least a first nucleic acid sequence encoding the polypeptide of any 
of claims 1 to 5; 

(b) selecting a recipient plant cell comprising the first nucleic acid sequence; 
and 

(c) regenerating a plant from the selected cell; 

wherein said plant has enhanced insect resistance relative to the corresponding ] 
transformed plant. 
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8.0 Seqpience Listing 

SEQtXENCE LISTING 

<110> Baum, James A. 
Chu, Chih-Rei 
Donovan, William P. 
Gilmer, Amy J. 
Rupar, Mark J . 

<120> Lepidopteran-Active Bacillus thuringiensis 

Delta-Endotoxin Compositions and Methods of Use 

<130> meco201 

<140> 
<141> 

<160> 63 

<170> Patentin Ver. 2.1 



<210> 1 
<211> 1899 
25 <212> DNA 

<213> Bacillus thuringiensis 

<220> 
<221> CDS 
30 <222> (1) . . (1899) 



40 



45 



50 



<400> 1 4. 
atg aat aat gta tta aat aac gga aga act act att tgt gat gcg tat 
Met Asn Asn Val Leu Asn Asn Gly Arg Thr Thr He Cys Asp Ala Tyr 

10 15 



35 1 5 



aat gta gtg gcc cat gat cca ttt agt ttt gag cat aaa tea tta gat 
Asn val val Ala His Asp Pro Phe Ser Phe Glu His Lys Ser Leu Asp 
20 25 30 

acc ate cga aaa gaa tgg atg gag tgg aaa aga aca gat cat agt tta 
Thr He Arg Lys Glu Trp Met Glu Trp Lys Arg Thr Asp His Ser Leu 
35 40 45 

tat gta get cct ata gte gga act gtt tct age ttt etg eta aag aag 
Tvr Val Ala Pro He Val Gly Thr Val Ser Ser Phe Leu Leu Lys Lys 
50 55 60 

gtg ggg agt ett att gga aaa agg ata ttg agt gaa tta tgg ggg tta 
Val Gly Ser Leu He Gly Lys Arg He Leu Ser Glu Leu Trp Gly Leu 
65 70 75 80 

ata ttt cct agt ggt age aca aat eta atg caa gat att tta agg gag 
He Phe Pro Ser Gly Ser Thr Asn Leu Met Gin Asp He Leu Arg Glu 
55 8 5 • 90 95 

aea gaa caa tte eta aat caa aga ctt aat aca gae act ett gcc cgt 



48 



96 



144 



192 



240 



288 



336 
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Thr Glu Gin Phe Leu Asn Gin Arg Leu Asn Thr Asp Thr Leu Ala Arg 
100 105 110 

gta aat gcg gaa ttg gaa ggg ctg caa gcg aat ata agg gag ttt aat 384 
Val Asn Ala Glu Leu Glu Gly Leu Gin Ala Asn lie Arg Glu Phe Asn 
115 120 125 

caa caa gta gat aat ttt tta aat cct act caa aac cct gtt cct tta 432 
Gin Gin Val Asp Asn Phe Leu Asn Pro Thr Gin Asn Pro Val Pro Leu 
130 135 140 

tea ata act tct tea gtt aat aca atg cag caa tta ttt eta aat aga 480 
Ser lie Thr Ser Ser Val Asn Thr Met Gin Gin Leu Phe Leu Asn Arg 
145 150 155 160 

tta ccc cag ttc cgt gtg caa gga tac caa ctg tta tta tta cct tta 528 
Leu Pro Gin Phe Arg Val Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 
165 170 175 

ttt gca cag gca gee aat atg cat ctt tct ttt att aga gat gtt gtt 576 
Phe Ala Gin Ala Ala Asn Met His Leu Ser Phe lie Arg Asp Val Val 
180 185 190 

etc aat gca gat gaa tgg gga att tea gca gca aca tta cgt acg tat 624 
Leu Asn Ala Asp Glu Trp Gly lie Ser Ala Ala Thr Leu Arg Thr Tyr 
195 200 205 

caa aat tat ctg aaa aat tat aca aca gag tac tct aat tat tgt ata 672 
Gin Asn Tyr Leu Lys Asn Tyr Thr Thr Glu Tyr Ser Asn Tyr Cys He 
210 215 220 

aat acg tat caa act gcg ttt aga ggt tta aac ace cgt tta cae gat 720 
Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His Asp 
225 230 235 240 

atg tta gaa ttt aga aca tat atg ttt tta aat gta ttt gaa tat gta 768 
Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 
245 250 255 

tct ate tgg teg ttg ttt aaa tat caa age ctt eta gta tct tct gge 816 
Ser lie Trp Ser Leu Phe Lys Tyr Gin Ser Leu Leu Val Ser Ser Gly 
260 265 270 

get aat tta tat gca age ggt agt gga cea cag cag act caa tea ttt 864 
Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Gin Gin Thr Gin Ser Phe 
275 280 285 

act tea caa gac tgg cca ttt tta tat tct ctt ttc caa gtt aat tea 912 
Thr Ser Gin Asp Trp^ Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser 
290 295 300 

aat tat gtg tta aat ggc ttt agt ggc get aga ctt acg cag act ttc 960 
Asn Tyr Val Leu Asn Gly Phe Ser Gly Ala Arg Leu Thr Gin Thr Phe 
305 310 315 320 

cct aat att ggt ggt tta cct ggt act act aca act cae gca ttg ctt 1008 
Pro Asn lie Gly Gly Leu Pro Gly Thr Thr Thr Thr His Ala Leu Leu 
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325 330 335 

gcg gca agg gtc aat tac agt gga gga gtt teg tct ggt gat ata ggc 1056 
Ala Ala Arg Val Asn Tyr Ser Gly Gly Val Ser Ser Gly Asp lie Gly 
5 340 345 350 

get gtg ttt aat caa aat ttt agt tgt age aca ttt etc eea cct ttg 1104 
Ala Val Phe Asn Gin Asn Phe Ser Cys Ser Thr Phe Leu Pro Pro Leu 
355 360 365 

10 

tta aca cca ttt gtt agg agt tgg eta gat tea ggt tea gat cga ggg 1152 
Leu Thr Pro Phe Val Arg Ser Trp Leu Asp Ser Gly Ser Asp Arg Gly 
370 375 380 



) 15 ggt gtt aat ace gtt aea aat tgg caa aca gaa teg ttt gag tea act 
Gly Val Asn Thr Val Thr Asn Trp Gin Thr Glu Ser Phe Glu Ser Thr 
385 390 395 400 



35 tct gtg cat aat aga aaa aat aat ate tat gee gtg cat gaa aat ggt 
' Ser Val His Asn Arg Lys Asn Asn He Tyr Ala Val His Glu Asn Gly 
465 470 475 480 

act atg att eat tta gcg ccg gaa gat tat aea gga ttc ace ata teg 
40 Thr Met He His Leu Ala Pro Glu Asp Tyr Thr Gly Phe Thr He Ser 

485 490 495 



55 



tat tta aga gta tct tea eta gga aat tee act att cga gtt act ata 
Tyr Leu Arg Val Ser Ser Leu Gly Asn Ser Thr lie Arg Val Thr He 
545 550 555 560 



1200 



tta ggt tta agg tgt ggt get ttt aca get egt ggt aat tea aac tat 1248 
20 Leu Gly Leu Arg . Cys Gly Ala Phe Thr Ala Arg Gly Asn Ser Asn Tyr 

405 410 415 

ttc cca gat tat ttt ate cgt aat att tea gga gtt cct tta gtt gtt 1296 
Phe Pro Asp Tyr Phe He Arg Asn He Ser Gly Val Pro Leu Val Val 
25 420 425 430 

aga aat gaa gat tta aga aga ccg tta eac tat aat gaa ata aga aat 1344 

Arg Asn Glu Asp Leu Arg Arg Pro Leu His Tyr Asn Glu He Arg Asn 
435 440 445 

30 

ata gaa agt cct tea gga aca cct ggt gga tta cga get tat atg gta 1392 

He Glu Ser Pro Ser Gly Thr Pro Gly Gly Leu Arg Ala Tyr Met Val 
450 455 460 



1440 



1488 



ccg ata eat gca act caa gtg aat aat caa aeg cga aca ttt att tct 1536 

Pro He His Ala Thr Gin Val Asn Asn Gin Thr Arg Thr Phe He Ser 
45 500 505 510 

gaa aaa ttt gga aat caa ggt gat tec tta aga ttt gaa caa age aac 1584 

Glu Lys Phe Gly Asn Gin Gly Asp Ser Leu Arg Phe Glu Gin Ser Asn 

515 520 525 

50 

aeg aca gca cgt tat aca ctt aga gga aat gga aat agt tac aat ctt 1632 

Thr Thr Ala Arg Tyr Thr Leu Arg Gly Asn Gly Asn Ser Tyr Asn Leu 
530 535 540 



1680 
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aac ggt agg gtt tat act get tea aat gtt aat aet act aca aat aae 1728 
Asn Gly Arg Val Tyr Thr Ala Ser Asn Val Asn Thr Thr Thr Asn Asn 
565 570 575 

5 

gat gga gtt aat gat aat gge get cgt ttt tta gat att aat atg ggt 1776 
Asp Gly Val Asn Asp Asn Gly Ala Arg Phe Leu Asp He Asn Met Gly 
580 585 590 

10 aat gta gta gca agt gat aat act aat gta ccg tta gat ata aat gtg 1824 
Asn Val Val Ala Ser Asp Asn Thr Asn Val Pro Leu Asp He Asn Val 
595 600 605 

aca ttt aae tec ggt act caa ttt gag ctt atg aat att atg ttt gtt 1872 
IS Thr Phe Asn Ser Gly Thr Gin Phe Glu Leu Met Asn lie Met Phe Val 
610 615 620 

cca act aat ctt cca cca ata tat taa 1899 
Pro Thr Asn Leu Pro Pro He Tyr 
20 62 5 630 



<210> 2 

<211> 632 

25 <212> PRT 

<213> Bacillus thuringiensis. 

<400> 2 

Met Asn Asn Val Leu Asn Asn Gly Arg Thr Thr lie Cys Asp Ala Tyr 
30 1 5 10 15 

Asn Val Val Ala His Asp Pro Phe Ser Phe Glu His Lys Ser Leu Asp 

20 25 30 

Thr He Arg Lys Glu Trp Met Glu Trp Lys Arg Thr Asp His Ser Leu 
35 40 ' 45 

35 Tyr Val Ala Pro He Val Gly Thr Val Ser Ser Phe Leu Leu Lys Lys 
50 55 60 

Val Gly Ser Leu He Gly Lys Arg He Leu Ser Glu Leu Trp Gly Leu 
65 70 75 80 

He Phe Pro Ser Gly Ser Thr Asn Leu Met Gin Asp He Leu Arg Glu 
40 85 90 95 

Thr Glu Gin Phe Leu Asn Gin Arg Leu Asn Thr Asp Thr Leu Ala Arg 

100 105 110 

Val Asn Ala Glu Leu Glu Gly Leu Gin Ala Asn He Arg Glu Phe Asn 
115 120 125 

45 Gin Gin Val Asp Asn Phe Leu Asn Pro Thr Gin Asn Pro Val Pro Leu 
130 135 140 

Ser He Thr Ser Ser Val Asn Thr Met Gin Gin Leu Phe Leu Asn Arg 
145 150 155 160 

Leu Pro Gin Phe Arg Val Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 
50 165 170 175 

Phe Ala Gin Ala Ala Asn Met His Leu Ser Phe He Arg Asp Val Val 

180 185 190 

Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Leu Arg Thr Tyr 
195 200 205 

55 Gin Asn Tyr Leu Lys Asn Tyr Thr Thr Glu Tyr Ser Asn Tyr Cys He 
210 215 220 

Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His Asp 



wo 01/19859 



PCTAJSOO/25361 



10 



-5 



225 230 235 240 

Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 

245 250 255 

Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Leu Val Ser Ser Gly 

260 265 270 

Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Gin Gin Thr Gin Ser Phe 

275 280 285 

Thr Ser Gin Asp Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser 

290 ' 295 300 

Asn Tyr Val Leu Asn Gly Phe Ser Gly Ala Arg Leu Thr Gin Thr Phe 
305 310 315 320 

Pro Asn He Gly Gly Leu Pro Gly Thr Thr Thr Thr His Ala Leu Leu 

325 330 335 

Ala Ala Arg Val Asn Tyr Ser Gly Gly Val Ser Ser Gly Asp He Gly 
)l5 340 345 350 

Ala Val Phe Asn Gin Asn Phe Ser Cys Ser Thr Phe Leu Pro Pro Leu 

355 360 365 

Leu Thr Pro Phe Val Arg Ser Trp Leu Asp Ser Gly Ser Asp Arg Gly 

370 375 380 

Gly Val Asn Thr Val Thr Asn Trp Gin Thr Glu Ser Phe Glu Ser Thr 
385 390 395 400 

Leu Gly Leu Arg Cys Gly Ala Phe Thr Ala Arg Gly Asn Ser Asn Tyr 

405 410 415 

Phe Pro Asp Tyr Phe He Arg Asn He Ser Gly Val Pro Leu Val Val 

420 425 430 

Arg Asn Glu Asp Leu Arg Arg Pro Leu His Tyr Asn Glu He Arg Asn 

435 440 445 

He Glu Ser Pro Ser Gly Thr Pro Gly Gly Leu Arg Ala Tyr Met Val 
450 455 460 

30 Ser Val His Asn Arg Lys Asn Asn He Tyr Ala Val His Glu Asn Gly 
465 470 475 480 

Thr Met He His Leu Ala Pro Glu Asp Tyr Thr Gly Phe Thr He Ser 

485 490 495 

Pro He His Ala Thr Gin Val Asn Asn Gin Thr Arg Thr Phe He Ser 
35 5 0 0 5 0 5 5 1 0 

Glu Lys Phe Gly Asn Gin Gly- Asp Ser Leu Arg Phe Glu Gin Ser Asn 

515 520 525 

Thr Thr Ala Arg Tyr Thr Leu Arg Gly Asn Gly Asn Ser Tyr Asn Leu 
530 535 540 

40 Tyr Leu Arg Val Ser Ser Leu Gly Asn Ser Thr He Arg Val Thr He 
545 550 555 560 

Asn Gly Arg Val Tyr Thr Ala Ser Asn Val Asn Thr Thr Thr Asn Asn 

565 570 575 

Asp Gly Val Asn Asp Asn Gly Ala Arg Phe Leu Asp He Asn Met Gly 
45 580 585 590 

Asn Val Val Ala Ser Asp Asn Thr Asn Val Pro Leu Asp He Asn Val 

595 600 605 

Thr Phe Asn Ser Gly Thr Gin Phe Glu Leu Met Asn He Met Phe Val 
610 615 620 

50 Pro Thr Asn Leu Pro Pro He Tyr 
625 630 



20 



25 



55 <210> 3 

<211> 729 
<212> DNA 
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<213> Bacillus thuringiensis 
<400> 3 

ttcgctagga accaagccat ttctagatta gaaggactaa gcaatcttta tcaaatttac 60 

5 gcagaatctt ttagagagtg ggaagcagat cctactaatc cagcattaag agaagagatg 120 

cgtattcaat tcaatgacat gaacagtgcc cttacaaccg ctattcctct tttggcagtt 180 

caaaattatc aagttcctct tttatcagta tatgttcaag ctgcaaattt acatttatca 240 

gttttgagag atgtttcagt gtttggacaa aggtggggat ttgatgccgc gactatcaat 300 

agtcgttata atgatttaac taggcttatt ggcaactata cagattatgc tgtgcgctgg 360 

10 tacaatacgg gattagagcg tgtatgggga ccggattcta gagattgggt aaggtataat 420 

caatttagaa gagagctaac acttactgta ttagatatcg ttgctctatt ctcaaattat 480 

gatagtcgaa ggtatccaat tcgaacagtt tcccaattaa caagagaaat ttatacgaac 540 

ccagtattag aaaattttga tggtagtttt cgtggaatgg ctcagagaat agaacagaat 600 

attaggcaac cacatcttat ggatatcctt aatagtataa ccatttatac tgatgtgcat 660 

15 agaggcttta attattggtc agggcatcaa ataacagctt ctcctgtagg gttttcagga 720 

ccagaattc 729 

<210> 4 

20 <211> 243 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 4 

25 Phe Ala Arg Asn Gin Ala lie Ser Arg Leu Glu Gly Leu Ser Asn Leu 
15 10 15 

Tyr Gin lie Tyr Ala Glu Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr 
20 25 30 



30 



45 



Asn Pro Ala Leu Arg Glu Glu Met Arg lie Gin Phe Asn Asp Met Asn 
35 40 45 



Ser Ala Leu Thr Thr Ala lie Pro Leu Leu Ala Val Gin Asn Tyr Gin 
35 50 55 60 

Val Pro Leu Leu Ser Val Tyr Val Gin Ala Ala Asn Leu His Leu Ser 
65 70 75 80 

40 Val Leu Arg Asp Val Ser Val Phe Gly Gin Arg Trp Gly Phe Asp Ala 

85 90 95 

Ala Thr lie Asn Ser Arg Tyr Asn Asp Leu Thr Arg Leu lie Gly Asn 
100 105 110 



Tyr Thr Asp Tyr Ala Val Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val 
115 120 125 



Trp Gly Pro Asp Ser Arg Asp Trp Val Arg Tyr Asn Gin Phe Arg Arg 
50 130 135 140 

Glu Leu Thr Leu Thr Val Leu Asp lie Val Ala Leu Phe Ser Asn Tyr 
145 150 155 160 

55 Asp Ser Arg Arg Tyr Pro lie Arg Thr Val Ser Gin Leu Thr Arg Glu 

165 170 175 
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Tle Tvr Thr Asn Pro Val Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly 
^ 180 185 190 

Met Ala Gin Arg He Glu Gin Asn He Arg Gin Pro His Leu Met Asp 
195 200 205 

He Leu Asn Ser He Thr He Tyr Thr Asp Val His Arg Gly Phe Asn 



210 



215 220 



Tyr Trp Ser Gly His Gin He Thr Ala Ser Pro Val Gly Phe Ser Gly 



225 



Pro Glu Phe 



230 235 240 



<210> 5 
<211> 1959 
<212> DNA 

<213> Bacillus thuringiensis 

gaaaatgaga atgaaattat aaatgcctta tcgattccag ctgtatogaa tcattccgca 60 
caaatggatc tatcgctaga tgctcgtatt gaggattctt tgtgtatagc cgaggggaat 120 
aatatcaatc cacttgttag cgcatcaaca gtccaaacgg gtataaacat agctggtaga 180 
atattgggcg tattaggtgt gccgtttgct ggacaactag ctagttttta tagttttctt 240 
gttggggaat tatggcctag tggtagagat ccatgggaaa ttttcctgga atatgtagaa 300 
caacttataa gacaacaagt aacagaaaat actaggaata cggctattgc tcgattagaa 360 
ggtctaggaa gaggctatag atcttaccag caggctcttg aaacttggtt agataaccga 420 
aatgatgcaa gatcaagaag cattattctt gagcgctatg ttgctttaga acttgacatt 480 
actactgcta taccgctttt cagaatacga aatgaagaag ttccattatt aatggtatat 540 
gctcaagctg caaatttaca cctattatta ttgagagacg catccotttt tggtagtgaa 600 
tgggggatgg catcttccga tgttaaccaa tattaccagg aacaaatcag atatacagag 660 
gaatattcta accattgcgt acaatggtat aatacagggc taaataactt aagagggaca 720 
aatgctgaaa gttggttgcg gtataatcaa ttccgtagag acctaacgtt aggggtatta 780 
gatttagtag ccctattccc aagctatgat actcgcactt atccaatcaa tacgagtgct 840 
cagttaacaa gagaaattta tacagatcca attgggagaa caaatgcacc ttcaggattt 900 
gcaagtacga attggtttaa taataatgca ccatcgtttt ctgccataga ggctgccatt 960 
ttcaggcctc cgcatctact tgattttcca gaacaactta caatttacag tgcatcaagc 1020 
cgttggagta gcactcaaca tatgaattat tgggtgggac ataggcttaa cttccgccca 1080 
ataggaggga cattaaatac ctcaacacaa ggacttacta ataatacttc aattaatcct 1140 
gtaacattac attacgtttc gtctcgtgac gtttatagaa cagaatcaaa tgcagggaca 1200 
aatatactat ttactactcc tgtgaatgga gtaccttggg ctagatttaa ttttataacc 1260 
ctcagaatat ttatgaaaga ggcgccacta cctacagtca accgtatcag ggagttggga 1320 
ttcaattatt tgattcagaa actgaattac caccagaaac aacagaacga ccaaattatg 1380 
aatcatatag tcatagatat ctcatataga ctaatcatag gaaacacttt gagagcacca 1440 
gtctattctt ggacgcatcg tagtgcagat cgtacgaata cgattggacc aaatagaatt 1500 
actcaaattc ctgcagtgaa gggaagattt ctttttaatg gttctgtgat ttcaggacca 1560 
ggatttactg gtggagacgt agttagattg aataggaata atggtaatat ccaaaataga 1620 
gggtatattg aagttccaat tcaattcacg tcgacatcta ccagatatcg agttcgagta 1680 
cgttatgctt ctgtaacctc gattgagctc aatgttaatt tgggcaattc atcaattttt 1740 
acgaacacat taccagcaac agctgcatca ttagataatc tacaatcagg ggattttggt 1800 
tatgttgaaa tcaacaatgc ttttacatcc gcaacaggta atatagtagg tgctagaaat 1860 
tttagtgcaa atgcagaagt aataatagac agatttgaat ttatcccagt tactgcaacc 1920 
ttcgaggtag aatatgattt agaaagagca caaaaggcg 1959 
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<210> 6 
<211> 653 
<212> PRT 

<213> Bacillus thuringiensis 

5 

<400> S 

Glu Asn Glu Asn Glu lie lie Asn Ala Leu Ser lie Pro Ala Val Ser 
15 10 15 

10 Asn His Ser Ala Gin Met Asp Leu Ser Leu Asp Ala Arg lie Glu Asp 

20 25 30 

Ser Leu Cys lie Ala Glu Gly Asn Asn lie Asn Pro Leu Val Ser Ala 
35 40 45 

15 

Ser Thr Val Gin Thr Gly He Asn He Ala Gly Arg He Leu Gly Val 
50 55 60 

Leu Gly Val Pro Phe Ala Gly Gin Leu Ala Ser Phe Tyr Ser Phe Leu 
20 65 70 75 80 

Val Gly Glu Leu Trp Pro Ser Gly Arg Asp Pro Trp Glu He Phe Leu 
85 90 95 

25 Glu Tyr Val Glu Gin Leu He Arg Gin Gin Val Thr Glu Asn Thr Arg 
100 105 110 

Asn Thr Ala He Ala Arg Leu Glu Gly Leu Gly Arg Gly Tyr Arg Ser 
115 120 125 

30 

Tyr Gin Gin Ala Leu Glu Thr Trp Leu Asp Asn Arg Asn Asp Ala Arg 
130 135 140 

Ser Arg Ser He He Leu Glu Arg Tyr Val Ala Leu Glu Leu Asp He 
35 145 150 155 160 

Thr Thr Ala He Pro Leu Phe Arg He Arg Asn Glu Glu Val Pro Leu 
165 170 175 

40 Leu Met Val Tyr Ala Gin Ala Ala Asn Leu His Leu Leu Leu Leu Arg 
180 185 190 

Asp Ala Ser Leu Phe Gly Ser Glu Trp Gly Met Ala Ser Ser Asp Val 
195 200 205 

45 

Asn Gin Tyr Tyr Gin Glu Gin He Arg Tyr Thr Glu Glu Tyr Ser Asn 
210 215 220 

His Cys Val Gin Trp Tyr Asn Thr Gly Leu Asn Asn Leu Arg Gly Thr 
50 225 230 235 240 

Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gin Phe Arg Arg Asp Leu Thr 
245 250 255 



55 Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro Ser Tyr Asp Thr Arg 
260 265 270 
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Thr Tyr Pro He Asn Thr Ser Ala Gin Leu Thr Arg Glu He Tyr Thr 
275 280 285 

Asp Pro He Gly Arg Thr Asn Ala Pro Ser Gly Phe Ala Ser Thr Asn 
290 295 300 

Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala He Glu Ala Ala He 
305 310 315 320 

I Phe Arg Pro Pro His Leu Leu Asp Phe Pro Glu Gin Leu Thr He Tyr 
325 330 335 

Ser Ala Ser Ser Arg Trp Ser Ser Thr Gin His Met Asn Tyr Trp Val 
340 345 350 

Gly His Arg Leu Asn Phe Arg Pro He Gly Gly Thr Leu Asn Thr Ser 
355 360 365 

Thr Gin Gly Leu Thr Asn Asn Thr Ser He Asn Pro Val Thr Leu His 
J 370 375 380 

Tvr Val ser Ser Arg Asp Val Tyr Arg Thr Glu Ser Asn Ala Gly Thr 
385 390 395 400 

5 Asn He Leu Phe Thr Thr Pro Val Asn Gly Val Pro Trp Ala Arg Phe 

405 410 415 

Asn Phe He Thr Leu Arg He Phe Met Lys Glu Ala Pro Leu Pro Thr 
420 425 430 

^ val Asn Arg He Arg Glu Leu Gly Phe Asn Tyr Leu He Gin Lys Leu 
435 440 445 

Asn Tyr His Gin Lys Gin Gin Asn Asp Gin He Met Asn His He Val 
5 450 455 460 

He Asp He Ser Tyr Arg Leu He He Gly Asn Thr Leu Arg Ala Pro 
465 470 475 480 

40 Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn Thr He Gly 

485 490 495 

Pro Asn Arg He Thr Gin He Pro Ala Val Lys Gly Arg Phe Leu Phe 
500 505 510 

45 

Asn Gly Ser Val He Ser Gly Pro Gly Phe Thr Gly Gly Asp Val Val 
515 520 525 

Arg Leu Asn Arg Asn Asn Gly Asn He Gin Asn Arg Gly Tyr He Glu 
50 530 535 540 

Val Pro He Gin Phe Thr Ser Thr Ser Thr Arg Tyr Arg Val Arg Val 
545 550 555 560 

55 Arg Tyr Ala Ser Val Thr Ser He Glu Leu Asn Val Asn Leu Gly Asn 

565 570 575 
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Ser Ser lie Phe Thr Asn Thr Leu Pro Ala Thr Ala Ala Ser Leu Asp 
580 585 590 

Asn Leu Gin Ser Gly Asp Phe Gly Tyr Val Glu lie Asn Asn Ala Phe 
5 595 600 605 

Thr Ser Ala Thr Gly Asn lie Val Gly Ala Arg Asn Phe Ser Ala Asn 
610 615 620 

10 Ala Glu Val He lie Asp Arg Phe Glu Phe He Pro Val Thr Ala Thr 
625 630 635 640 

Phe Glu Val Glu Tyr Asp Leu Glu Arg Ala Gin Lys Ala 
645 650 

15 

<210> 7 
<211> 328 
<212> DNA 
20 <213> Bacillus thuringiensis 

<400> 7 

ctttacagga agattaccac aaagttatta tatcgtttcc gttatgcttc gggagcaaat 60 
aggagtggtt cattaagtta ttcacagcaa acttcgtatg taatttcatt tccaaaaact 120 
25 atggacgcag gtgaaccact aaicatctcgt tcgttcgctt ttacaacaac cgtcactcca 180 
atagccttta cacgagctca agaagaattt gatttataca tccaacagaa tgtttatata 240 
gatagagttg aatttatccc agtagatgca acatttgagg caaaatctga tttagaaaga 300 
gcgaaaaagg cggtgaatgc cttgttta 328 

30 

<210> 8 
<211> 109 
<212> PRT 

c213> Bacillus thuringiensis 

35 

<4Q0> 8 

Leu Tyr Arg Lys lie Thr Thr Lys Leu Leu Tyr Arg Phe Arg Tyr Ala 
15 10 15 

40 Ser Gly Ala Asn Arg Ser Gly Ser Leu Ser Tyr Ser Gin Gin Thr Ser 
20 25 30 

Tyr Val He Ser Phe Pro Lys Thr Met Asp Ala Gly Glu Pro Z*eu Thr 
35 40 45 

45 

Ser Arg Ser Phe Ala Phe Thr Thr Thr Val Thr Pro He Ala Phe Thr 
50 55 60 

Arg Ala Gin Glu Glu Phe Asp Leu Tyr He Gin Gin Asn Val Tyr He 
50 65 70 75 80 

Asp Arg Val Glu Phe He Pro Val Asp Ala Thr Phe Glu Ala Lys Ser 
85 90 95 

55 Asp Leu Glu Arg Ala Lys Lys Ala Val Asn Ala Leu Phe 
100 105 
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<210> 9 
<211> 340 
<212> DNA 
5 <213> Bacillus thuringiensis 

<400> 9 

ttacgagtaa cctttacagg aagattacca 
tcgggagcaa ataggagtgg ttcattaagt 
10 tttccaaaaa ctatggacgc aggtgaacca 
accgtcactc caataacctt tacacgagct 
aatgtttata tagatagagt tgaatttatc 
gatttagaaa gagcgaaaaa ggcggtgaat 

)l5 

<210> 10 
<211> 113 
<212> PRT 

<213> Bacillus thuringiensis 

20 

<400> 10 

Leu Arg Val Thr Phe Thr Giy Arg Leu Pro Gin Ser Tyr Tyr lie Arg 
15 10 15 

25 Phe Arg Tyr Ala Ser Gly Ala Asn Arg Ser Gly Ser Leu Ser Tyr Ser 
20 25 30 

Gin Gin Thr Ser Tyr Val lie Ser Phe Pro Lys Thr Met Asp Ala Gly 
35 40 45 

30 

Glu Pro Leu Thr Ser Arg Ser Phe Ala Phe Thr Thr Thr Val Thr Pro 
50 55 60 

lie Thr Phe Thr Arg Ala Gin Glu Glu Phe Asp Leu Tyr lie Gin Gin 
35 65 70 75 80 

Asn Val Tyr He Asp Arg Val Glu Phe He Pro Val Asp Ala Thr Phe 
85 90 95 

40 Glu Ala Lys Ser Asp Leu Glu Arg Ala Lys Lys Ala Val Asn Ala Leu 
100 105 110 

Phe 

45 

<210> 11 
<211> 306 
<212> DNA 
50 <213> Bacillus thuringiensis 

<400> 11 

gtatcgcgtg agatcgtatg ctctacgaca gatttacaat tctatacgaa tattaatgga 60 
actactatta atattggtaa tttctcgagc actatggaca gtggggatga tttacagtac 120 
55 ggaagattca gggttgcagg ttttactact ccatttacct tttcagatgc aaacagcaca 180 
ttcacaatag gtgcttttgg cttctctcca aacaacgaag tttatataga tcgaattgaa 240 
tttgtcccgg cagaagtaac atttgaggca gaatatgatt tagagaaagc tcagaaagcg 300 



caaagttatt atatacgttt ccgttatgct 60 
tattcacagc aaacttcgta tgtaatttca 120 
ctaacatctc gttcgttcgc ttttacaaca 180 
caagaagaat ttgatttata catccaacag 240 
ccagtagatg caacatttga ggcaaaatct 300 
gccttgttta 340 
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gtgaat 306 



<210> 12 
5 <211> 102 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 12 

10 Val Ser Arg Glu lie Val Cya Ser Thr Thr Asp Leu Gin Phe Tyx Thr 
15 10 15 

Asn lie Asn Gly Thr Thr lie Asn He Gly Asn Phe Ser Ser Thr Met 
20 25 30 



IS 



30 



Asp Ser Gly Asp Asp Leu Gin Tyr Gly Arg Phe Arg Val Ala Gly Phe 
35 40 45 



Thr Thr Pro Phe Thr Phe Ser Asp Ala Asn Ser Thr Phe Thr He Gly 
20 50 55 60 

Ala Phe Gly Phe Ser Pro Asn Asn Glu Val Tyr He Asp Arg He Glu 
65 70 75 80 

25 Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp Leu Glu Lys 

85 90 95 



Ala Gin Lys Ala Val Asn 
100 



<210> 13 
<211> 279 
<212> DNA 

35 <213> Bacillus thuringiensis r*^ 
<400> 13 

caattccata catcaattga cggaagacct attaatcagg ggaatttttc agcaactatg 60 
agtagtggga gtaatttaca gtccggaagc tttaggactg taggttttac tactccgttt 12 0 
40 aacttttcaa atggatcaag tgtatttacg ttaagtgctc atgtcttcaa ttcaggcaat 180 
gaagtttata tagatcgaat tgaatttatt ccggcagaag taacctttga ggcagaatat 240 
gatttagaaa gagcacaaaa ggcggtgaat gagctgttt 279 

45 <210> 14 
<211> 93 
<212> PRT 

<213> Bacillus thuringiensis 
50 <400> 14 

Gin Phe His Thr Ser He Asp Gly Arg Pro He Asn Gin Gly Asn Phe 
15 10 15 

Ser Ala Thr Met Ser Ser Gly Ser Asn Leu Gin Ser Gly Ser Phe Arg 
55 20 25 30 

Thr Val Gly Phe Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val 
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35 40 

Phe Thr Leu Ser Ala His Val Phe 
50 55 

5 

Asp Arg lie 61u Phe lie Pro Ala 
65 70 

Asp Leu Glu Arg Ala Gin Lys Ala 
10 85 



45 

Asn Ser Gly Asn Glu Val Tyr lie 
60 

Glu Val Thr Phe Glu Ala Glu Tyr 
75 80 

Val Asn Glu Leu Phe 
90 



<210> 15 
<211> 397 
I 15 <212> DNA 

<213> Bacillus thuringiensis 

<400> 15 

aggaccaggt tttacaggtg ggatatcctt cgaagaacga atgttggtag ctttggagat 60 

20 atgcgtgtaa acattactgc accactatca caaagatatc gcgtaagaat tcgctatgct 120 

tctacgacag atttacaatt tttcacgaga atcaatggaa cttctgtaaa tcaaggtaat 180 

ttccaaagaa ctatgaatag agggggtaat ttagaatctg gaaactttag gactgcagga 240 

tttagtacgc cttttagttt tttcaaatgc gcaaagtaca ttcacattgg gtactcaggc 300 

ttttcaaatc aggaagttta tatagatcga attgaatttg tcccggcaga agtaacattc 360 

25 gaggcagaat ctgatttgga aagagcgcaa aaggcgg 397 



<210> 16 
<211> 132 
30 <212> PRT 

<213> Bacillus thuringiensis 

<400> 16 

Arg Thr Arg Phe Tyr Arg Trp Asp lie Leu Arg Arg Thr Asn Val Gly 
35 1 5 10 15 

Ser Phe Gly Asp Met Arg Val Asn lie Thr Ala Pro Leu Ser Gin Arg 
20 25 30 

40 Tyr Arg Val Arg lie Arg Tyr Ala Ser Thr Thr Asp Leu Gin Phe Phe 
35 40 45 

Thr Arg lie Asn Gly Thr Ser Val Asn Gin Gly Asn Phe Gin Arg Thr 
50 55 60 

45 

Met Asn Arg Gly Gly Asn Leu Glu Ser Gly Asn Phe Arg Thr Ala Gly 
65 70 75 80 

Phe Ser Thr Pro Phe Ser Phe Phe Lys Cys Ala Lys Tyr lie His lie 
50 85 90 95 

Gly Tyr Ser Gly Phe Ser Asn Gin Glu Val Tyr lie Asp Arg lie Glu 
100 105 110 

55 Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Ser Asp Leu Glu Arg 
115 120 125 
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Ala Gin Lys Ala 
130 



5 <210> 17 
<211> 123 
<212> DNA 

<213> Bacillus thuringiensis 
10 <400> 17 

ataatctaca atcaggggga ttttggttat gttgaaatca acaatgcttt tacatccgca 60 
acaggtaata tagtaggtgc tagaaatttt acgtgcaaat gcagaagtaa taatagacag 120 
att 



15 



20 



25 



30 



<210> 18 
<211> 41 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 18 

He He Tyr Asn Gin Gly Asp Phe Gly Tyr Val Glu He Asn Asn Ala 
15 10 15 

Phe Thr Ser Ala Thr Gly Asn He Val Gly Ala Arg Asn Phe Thr Cys 
20 25 30 

Lys Cys Arg Ser Asn Asn Arg Gin He 
35 40 



<210> 19 
<211> 192 
<212> DNA 

35 <213> Bacillus thuringiensis ^ 
<400> 19 

agttattata tacgtttccg ttatgcttcc gtagctaata ggagtggtat atttagctat 60 
tcacagccaa cttcatatgg aatttccttt ccaaaaacta tggatgcaga tgaatcatta 120 
40 acatctcgtt catttgcact tgctacactt gctacaccgc taacctttag aaggcaagaa 180 
gaattaaatc ta 



<210> 20 

45 <211> 64 

<212> PRT 

<213> Bacillus thuringiensis 



50 



55 



<400> 20 

Ser Tyr Tyr He Arg Phe Arg Tyr Ala Ser Val Ala Asn Arg Ser Gly 
15 10 15 

He Phe Ser Tyr Ser Gin Pro Thr Ser Tyr Gly He Ser Phe Pro Lys 
20 25 30 

Thr Met Asp Ala Asp Glu Ser Leu Thr Ser Arg Ser Phe Ala Leu Ala 
35 40 45 
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Thr Leu Ala Thr Pro Leu Thr Phe Arg Arg Gin Glu Glu Leu Asn Leu 
50 55 60 



<210> 21 
10 <211> 3507 
<212> DNA 

<213> Bacillus thuringiensis 

<220> 
) 15 <221> CDS 

<222> (1) . . (3507) 

<400> 21 

atg gag ata aat aat cag aac caa tgc ata cca tat aat tgc tta agt 
20 Met Glu lie Asn Asn Gin Asn Gin Cya lie Pro Tyr Asn Cys Leu Ser 
15 10 15 

aat cct gag gaa gta ttt ttg gat ggg gag agg ata tta cct gat ate 
Asn Pro Glu Glu Val Phe Leu Asp Gly Glu Arg lie Leu Pro Asp He 
25 20 25 30 



30 



48 



96 



gat cca etc gaa gtt tct ttg teg ctt ttg caa ttt ctt ttg aat aac 144 
Asp Pro Leu Glu Val Ser Leu Ser Leu Leu Gin Phe Leu Leu Asn Asn 
35 40 45 

ttt gtt cca ggg ggg ggg ttt att tea gga tta ctt gat aaa ata tgg 192 
Phe Val Pro Gly Gly Gly Phe lie Ser Gly Leu Leu Asp Lys lie Trp 
50 55 60 

35 ggg get ttg aga cca tct gat tgg gaa tta ttt ctt gca cag att gaa 240 
' Gly Ala Leu Arg Pro Ser Asp Trp Glu Leu Phe Leu Ala Gin He Glu 
65 70 75 80 

cag ttg att gat ega aga ata gaa aga aca gta aga gca aaa gca ate 288 
40 Gin Leu He Asp Arg Arg lie Glu Arg Thr Val Arg Ala Lys Ala He 

85 90 95 

get gaa tta gaa ggt tta ggg aga agt tat caa eta tat gga gag gca 336 
Ala Glu Leu Glu Gly Leu Gly Arg Ser Tyr Gin Leu Tyr Gly Glu Ala 
45 100 105 110 

ttt aaa gag tgg gaa aaa act cca gat aac aca gcg get egg tct aga 384 
Phe Lys Glu Trp Glu Lys Thr Pro Asp Asn Thr Ala Ala Arg Ser Arg 
115 120 125 

50 

gta act gag aga ttt cgt ata att gat get caa att gaa gca aat ate 432 
. Val Thr Glu Arg Phe Arg He He Asp Ala Gin He Glu Ala Asn He 
130 135 140 

55 cct teg ttt egg gtt tec gga ttt gaa gtg cca ctt eta ttg gtt tat 480 
Pro Ser Phe Arg Val Ser Gly Phe Glu Val Pro Leu Leu Leu Val Tyr 
145 150 155 160 
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acc caa gca get aat ttg cat etc get eta tta aga gat tct gtt gtt 528 
Thr Gin Ala Ala Asn Leu His Leu Ala Leu Leu Arg Asp Ser Val Val 
165 170 175 



ttt gga gag aga tgg gga ttg aeg act aca aat gtc aat gat ate tat 
Phe Gly Glu Arg Trp Gly Leu Thr Thr Thr Asn Val Asn Asp He Tyr 
180 185 190 



576 



10 aat aga caa gtt aat aga att ggt gaa tat age aag cat tgt gta gat 624 
Asn Arg Gin Val Asn Arg He Gly Glu Tyr Ser Lys His Cys Val Asp 
195 200 205 

aeg tat aaa aca gaa tta gaa egt eta gga ttt aga tct ata gcg caa 672 
15 Thr Tyr Lys Thr Glu Leu Glu Arg Leu Gly Phe Arg Ser He Ala Gin 
210 215 220 



tgg aga ata tat aat cag ttt aga agg gaa ttg aca eta aeg gta tta 
Trp Arg He Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val Leu 
20 225 230 235 240 



720 



gat att gtc get gtt ttc ccg aac tat gat agt aga ctg tat ccg att 768 

Asp He Val Ala Val Phe Pro Asn Tyr Asp Ser Arg Leu Tyr Pro He 

245 250 255 

25 

cga aca att tct caa ttg aca aga gaa att tat aca tec cca gta age 816 

Arg Thr He Ser Gin Leu Thr Arg Glu He Tyr Thr Ser Pro Val Ser 

260 265 270 

30 gaa ttt tat tat ggt gtc att aat agt aat aat ata att ggt acc ctt 864 

Glu Phe Tyr Tyr Gly Val He Asn Ser Asn Asn He He Gly Thr Leu 

275 280 285 

act gaa cag caa ata agg cga cca cat ctt atg gae ttc ttt aac tec 912 

35 Thr Glu Gin Gin He Arg Arg Pro His Leu Met Asp Phe Phe Asn Ser ---^ 
290 295 3O0 

atg ate atg tat aeg tea gat aat aga cga gaa cat tat tgg tea gga 960 

Met He Met Tyr Thr Ser Asp Asn Arg Arg Glu His Tyr Trp Ser Gly 
40 305 310 315 320 

ctt gaa atg aeg get act aat act gag gga eat caa agg tea ttc cct 1008 
Leu Glu Met Thr Ala Thr Asn Thr Glu Gly His Gin Arg Ser Phe Pro 

325 330 335 

45 

tta get ggg act ata ggg aat tea get cca cca gta act gtt aga aat 

Leu Ala Gly Thr He Gly Asn Ser Ala Pro Pro Val Thr Val Arg Asn 

340 345 350 



1056 



50 aat ggt gag gga att tat aga ata tta teg gaa cca ttt tat tea gca 1104 

Asn Gly Glu Gly He Tyr Arg He Leu Ser Glu Pro Phe Tyr Ser Ala 
355 360 365 

cct ttt eta ggc aca agt gtg eta gga agt egt ggg gaa gaa ttt get 1152 

55 Pro Phe Leu Gly Thr Ser Val Leu Gly Ser Arg Gly Glu Glu Phe Ala 
370 375 380 
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ttt gca tct aat act act aca agt ctg cca tct aca ata tat aga aat 
Phe Ala Ser Asn Thr Thr Thr Ser Leu Pro Ser Thr He Tyr Arg Asn 
385 390 395 400 



1200 



5 cgt gga aca gta gat tea tta gtc age ata ccg cca cag gat tat age 1248 
Arg Gly Thr Val Asp Ser Leu Val Ser He Pro Pro Gin Asp Tyr Ser 
405 410 415 

gta cca ccg cac agg ggg tat agt cat tta tta agt cae gtt acg atg 1296 
10 Val Pro Pro His Arg Gly Tyr Ser His Leu Leu Ser His Val Thr Met 
420 425 430 

cgc aat agt tct cct ata ttc cac tgg aca cat cgt agt gca ace cct 1344 
Arg Asn Ser Ser Pro He Phe His Trp Thr His Arg Ser Ala Thr Pro 
; 15 435 440 445 



aga aat aca att gat cca gat agt ate act caa att cca gca gtt aag 

Arg Asn Thr He Asp Pro Asp Ser He Thr Gin He Pro Ala Val Lys 

450 45.5 460 

20 

gga geg tat att ttt aat agt cca gtc att act ggg cca gga cat aca 

Gly Ala Tyr He Phe Asn Ser Pro Val He Thr Gly Pro Gly His Thr 

465 470 475 480 

25 ggt ggg gat ata ata agg ttt aac cct aat act cag aac aae ata aga 
Gly Gly Asp He He Arg Phe Asn Pro Asn Thr Gin Asn Asn He Arg 

485 ^ 490 495 



30 



35 



40 



1392 



1440 



1488 



att cca ttt caa tea aat geg gta cag cgt tat cga att aga atg cgt 1536 
He Pro Phe Gin Ser Asn Ala Val Gin Arg Tyr Arg He Arg Met Arg 
500 505 510 

tat geg gca gaa get gat tgt att tta gaa agt gga gta aac att gtt 1584 
Tyr Ala Ala Glu Ala Asp Cys He Leu Glu Ser Gly Val Asn He Val 
515 520 525 

act ggg gca ggg gtc ace ttt agg cca att cct att aaa get aca atg 1632 
Thr Gly Ala Gly Val Thr Phe Arg Pro He Pro He Lys Ala Thr Met 
530 535 540 

act cct gga agt cct tta aca tat tac age ttc cag tat gca gat tta 
Thr Pro Gly Ser Pro Leu Thr Tyr Tyr Ser Phe Gin Tyr Ala Asp Leu 
545 550 555 560 



1680 



45 aat ata aat ctt act geg ccg ata aga cct aat aat ttt gta tct att 1728 

Asn He Asn Leu Thr Ala Pro He Arg Pro Asn Asn Phe Val Ser He 

565 570 575 

aga cgt tea aac caa cca gga aae ctt tat ata gat aga att gaa ttc 1776 

50 Arg Arg Ser Asn Gin Pro Gly Asn Leu Tyr He Asp Arg He Glu Phe 
580 585 590 

att cea att gac cca ate cgt gag gca gaa cat gat tta gaa aga geg 1824 

He Pro He Asp Pro He Arg Glu Ala Glu His Asp Leu Glu Arg Ala 
55 5 9 5 6 0 0 6 0 5 

caa aag geg gtg aat geg ctg ttt act tct tee aat caa eta gga tta 1872 
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Gln Lys Ala Val Asn Ala Leu Phe Thr Ser Ser Asn Gin Leu Gly Leu 
610 615 620 

aaa aca gat gtg acg gat tat cat att gat caa gtg tec aat tta gtt 1920 
5 Lys Thr Asp Val Thr Asp Tyr His lie Asp Gin Val Ser Asn Leu Val 
625 630 635 640 

gcg tgt tta teg gat gaa ttc tgc ctg gat gaa aag cga gaa ttg tec 1968 
Ala Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser 
10 645 650 655 

gag aaa gtt aaa cat gcg aag cga etc agt gat gag aga aat tta etc 2016 

Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu 

660 665 670 

IS 

caa gat caa aac ttt aca ggc ate aat agg caa gta gac cgt ggg tgg 2064 

Gin Asp Gin Asn Phe Thr Gly lie Asn Arg Gin Val Asp Arg Gly Trp 
675 680 685 

20 aga gga agt acg gat att acc ate caa gga ggg aat gat gta ttc aaa 2112 
Arg Gly Ser Thr Asp lie Thr lie Gin Gly Gly Asn Asp Val Phe Lys 
690 695 700 

gag aat tac gtc aca eta cca ggt acc ttt gat gag tgt tac eca acg 2160 
25 Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr 
705 710 715 720 

tat ttg tat caa aaa ata gat gag tea aaa tta aaa cct tat act cgc 2208 
Tyr Leu Tyr Gin Lys lie Asp Glu Ser Lys Leu Lys Pro Tyr Thr Arg 
30 725 730 735 

tat gaa tta aga ggg tat att gaa gat agt caa gac tta gaa gtc tat 2256 
Tyr Glu Leu Arg Gly Tyr lie Glu Asp Ser Gin Asp Leu Glu Val Tyr 

740 745 750 

35 

ttg ate cgt tac aat gca aaa cac gaa acg tta aat gtg cca ggt acg 2304 

Leu lie Arg Tyr Asn Ala Lys His Glu Thr Leu Asn Val Pro Gly Thr 
755 760 765 

40 ggt tec tta tgg cca ctt gca gee gaa agt tea ate ggg agg tgc ggc 2352 
Gly Ser Leu Trp Pro Leu Ala Ala Glu Ser Ser lie Gly Arg Cys Gly 
770 775 780 

gaa ccg aat cga tgc gcg cca cat att gaa tgg aat cct gaa eta gat 2400 
45 Glu Pro Asn Arg Cys Ala Pro His lie Glu Trp Asn Pro Glu Leu Asp 
785 790 795 800 

tgt teg tgt agg gat gga gaa aaa tgt gca cat cat tct cat cat ttc 2448 
Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe 
50 805 810 815 

tec ttg gat att gat gtt gga tgt aca gac tta aat gag gat tta ggt 2496 

Ser Leu Asp lie Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly 
820 825 830 

55 

gta tgg gtg ata ttt aag att aag acg caa gat ggc tat gca aga eta 2544 

Val Trp Val lie Phe Lys lie Lys Thr Gin Asp Gly Tyr Ala Arg Leu 
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835 840 845 



gga aat tta gag ttt etc gaa gag aaa cca ttg tta gga gaa gcg eta 2592 
Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu 
5 850 855 860 

get cgt gtg aag aga gcg gag aaa aaa tgg aga gae aaa cge gae aaa 2640 
Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Asp Lys 
865 870 875 880 

ttg gaa tgg gaa aca aat att gtt tat aaa gag gca aaa gaa tet gta 2688 
Leu Glu Trp Glu Thr Asn lie Val Tyr Lys Glu Ala Lys Glu Ser Val 
885 890 895 

) 15 gat get tta tte gta gat tet eaa tat aat aga tta caa aeg gat acg 2736 
Asp Ala Leu Phe Val Asp Ser Gin Tyr Asn Arg Leu Gin Thr Asp Thr 
900 905 910 



10 



aac att gcg atg att cat gtg gca gat aaa cgc gtt cat cga ate cga 
20 Asn He Ala Met He His Val Ala Asp Lys Arg Val His Arg He Arg 
915 920 925 



cgt teg gtc ett gtt gtt ceg gaa tgg gaa gea gaa gtg tea eaa gaa 
40 Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser Gin Glu 
995 1000 1005 



55 ceg aat aac acg gta acg tgt aat gat tat cca gca aat caa gaa gaa 
Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Pro Ala Asn Gin Glu Glu 
1060 1065 1070 



2784 



gaa geg tat ttg cca gag tta tet gtg att ceg ggt gtc aat gcg get 2832 
Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val Asn Ala Ala 
25 930 935 940 

att tte gaa gaa tta gaa ggt ctt att tte act gca tte tec eta tat 2880 
He Phe Glu Glu Leu Glu Gly Leu He Phe Thr Ala Phe Ser Leu Tyr 
945 950 955 960 

30 

gat geg aga aat gtc att aaa aac gga gat tte aat eat ggt tta tea 2928 
Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn His Gly Leu Ser 
965 970 975 

• 35 tgc tgg aac gtg aaa ggg cat gta gat gta gaa gaa caa aat aac cac 2976 
' Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gin Asn Asn His 
980 985 990 



3024 



gtc cgc gta tgt cca gga cgt ggc tat ate ctg cgt gtt aca gcg tac 3072 

Val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr 
45 1010 1015 1020 

aaa gag ggc tac gga gaa gga tgc gta acg ate cat gaa att gaa gat 3120 

Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu Asp 
1025 1030 1035 1040 

50 

eat aca gae gaa ctg aaa ttt aga aac tgt gaa gaa gag gaa gtg tat 3168 

His Thr Asp Glu Leu Lys Phe Arg Asn Cys Glu Glu Glu Glu Val Tyr 
1045 1050 1055 



3216 
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gaa gga aca ttc ate gta gac agt gtg gaa tta etc etc atg gag gaa 
Glu Gly Thr Phe lie val Asp Ser Val Glu Leu Leu Leu Met Glu Giu 
1155 1160 1165 



25 

tag 



tac agg get gcg gaa act tec egt aat cgt gga tat ggc gaa tct tat 3264 
Tyr Arg Ala Ala Glu Thr Ser Arg Asn Arg Gly Tyr Gly Glu Ser Tyr 
1075 lOBO 1085 

5 

gaa agt aat tct tec ata oca get gag tat gcg cca att tat gag aaa 3312 
Glu Ser Asn Ser Ser He Pro Ala Glu Tyr Ala Pro He Tyr Glu Lys 
1090 1095 1100 

10 gca tat aca gat gga aga aaa gag aat tct tgt gaa tct aac aga gga 3360 
Ala Tyr Thr Asp Gly Arg Lys Glu Asn Ser Cys Glu Ser Asn Arg Gly 
1105 1110 1115 1120 

tat gga aat tac aca ccg tta oca gca ggt tat gtg aca aaa gaa tta 3408 
15 Tyr Gly Asn Tyr Thr Pro Leu Pro Ala Gly Tyr Val Thr Lys Glu Leu 
1125 1130 1135 

gag tac ttc cca gaa acc gat aag gta tgg ata gag att gga gaa acg 3456 
Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp He Glu He Gly Glu Thr 
20 1140 1145 1150 



3504 



3507 



30 


<210> 22 






















<211> 1168 






















<212> PRT 






















<213> Bacillus thuringiensis 










35 


<400> 22 
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40 


Asp 


Pro 


Leu 
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35 










40 


45 
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Phe 
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He 
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50 










55 




60 










Gly 
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Trp Glu Leu Phe 
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Gin 


He 
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45 


65 










70 




75 








80 
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He 
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He 
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He 












85 






90 
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100 
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110 






SO 


Phe 


Lys 


Glu 


Trp 
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115 
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55 


145 
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185 190 
El Arg lie Gly Glu Tyr Ser Lys His Cys Val Asp 

200 205 
u Leu Glu Arg Leu Gly Phe Arg Ser lie Ala Gin 

215 220 
El Gin Phe Arg Arg Glu Leu Thr Leu Thr Val Leu 
230 235 240 

1 Phe Pro Asn Tyr Asp Ser Arg Leu Tyr Pro lie 
10 245 250 255 

a Leu Thr Arg Glu lie Tyr Thr Ser Pro Val Ser 

265 270 
/ Val lie Asn Ser Asn Asn lie lie Gly Thr Leu 

280 285 
a Arg Arg Pro His Leu Met Asp Phe Phe Asn Ser 

295 300 
r Ser Asp Asn Arg Arg Glu His Tyr Trp Ser Gly 
310 315 320 

a Thr Asn Thr Glu Gly His Gin Arg Ser Phe Pro 
20 325 330 335 

e Gly Asn Ser Ala Pro Pro Val Thr Val Arg Asn 

345 350 
e Tyr Arg lie Leu Ser Glu Pro Phe Tyr Ser Ala 

360 365 
r Ser Val Leu Gly Ser Arg Gly Glu Glu Phe Ala 

375 380 
r Thr Thr Ser Leu Pro Ser Thr lie Tyr Arg Asn 
390 395 400 

p Ser Leu Val Ser lie Pro Pro Gin Asp Tyr Ser 
30 405 410 415 

J Gly Tyr Ser His Leu Leu Ser His Val Thr Met 

425 430 
3 lie Phe His Trp Thr His Arg Ser Ala Thr Pro 

440 445 
p Pro Asp Ser He Thr Gin He Pro Ala Val Lys 

455 460 
g Asn Ser Pro Val He Thr Gly Pro Gly His Thr 
470 475 480 

3 Arg Phe Asn Pro Asn Thr Gin Asn Asn He Arg 
40 485 490 495 

r Asn Ala Val Gin Arg Tyr Arg He Arg Met Arg 

505 510 
a Asp Cys He Leu Glu Ser Gly Val Asn He Val 

520 525 
L Thr Phe Arg Pro He Pro He Lys Ala Thr Met 

535 540 
D Leu Thr Tyr Tyr Ser Phe Gin Tyr Ala Asp Leu 
550 555 560 

r Ala Pro He Arg Pro Asn Asn Phe Val Ser He 
50 565 570 575 

1 Pro Gly Asn Leu Tyr He Asp Arg He Glu Phe . 
585 590 

0 He Arg Glu Ala Glu His Asp Leu Glu Arg Ala 
600 605 

1 Ala Leu Phe Thr Ser Ser Asn Gin Leu Gly Leu 
615 620 
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625 630 635 640 

Ala Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser 

645 650 655 

Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu 
5 660 665 670 

Gin Asp Gin Asn Phe Thr Gly lie Asn Arg Gin Val Asp Arg Gly Trp 

675 680 685 

Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asn Asp Val Phe Lys 
690 695 700 

10 Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr 
705 710 715 720 

Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu Lys Pro Tyr Thr Arg 

725 730 735 

Tyr Glu Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu Val Tyr 
15 740 745 750 

Leu He Arg Tyr Asn Ala Lys His Glu Thr Leu Asn Val Pro Gly Thr 

755 760 765 

Gly ser Leu Trp Pro Leu Ala Ala Glu Ser Ser He Gly Arg Cys Gly 
770 775 780 

20 Glu Pro Asn Arg Cys Ala Pro His He Glu Trp Asn Pro Glu Leu Asp 
785 790 795 800 

Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe 

805 810 815 

Ser Leu Asp He Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly 
25 820 825 830 

Val Trp Val He Phe Lys He Lys Thr Gin Asp Gly Tyr Ala Arg Leu 

835 840 845 

Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu 
850 855 860 

30 Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Asp Lys 
865 870 875 880 

Leu Glu Trp Glu Thr Asn He Val Tyr Lys Glu Ala Lys Glu Ser Val 

885 890 895 

Asp Ala Leu Phe Val Asp Ser Gin Tyr Asn Arg Leu Gin Thr Asp Thr 
35 900 905 910 

Asn He Ala Met He His Val Ala Asp Lys Arg Val His Arg He Arg 

915 920 925 

Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val Asn Ala Ala 
930 935 940 

40 He Phe Glu Glu Leu Glu Gly Leu He Phe Thr Ala Phe Ser Leu Tyr 
945 950 955 960 

Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn His Gly Leu Ser 

965 970 975 

Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gin Asn Asn His 
45 980 985 990 

Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser Gin Glu 

995 1000 1005 

val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr 
1010 1015 1020 

50 Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu Asp 
1025 • 1030 1035 1040 

His Thr Asp Glu Leu Lys Phe Arg Asn Cys Glu Glu Glu Glu Val Tyr 

1045 1050 1055 

Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Pro Ala Asn Gin Glu Glu 
55 1060 1065 1070 

Tyr Arg Ala Ala Glu Thr Ser Arg Asn Arg Gly Tyr Gly Glu Ser Tyr 
1075 1080 1085 
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Glu Ser Asn Ser Ser He Pro Ala Glu Tyr Ala Pro He Tyr Glu Lys 

1090 1095 1100 

Ala Tyr Thr Asp Gly Arg Lys Glu Asn Ser Cys Glu Ser Asn Arg Gly 
1105 1110 1115 1120 

5 Tyr Gly Asn Tyr Thr Pro Leu Pro Ala Gly Tyr Val Thr Lys Glu Leu 
1125 1130 1135 

Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp He Glu He Gly Glu Thr 

1140 1145 1150 

Glu Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
10 1155 1160 1165 



<210> 23 
' !5 <211> 348 
<212> DNA 

<213> Bacillus thuringiensis 

<400> 23 

20 aataatagag gtcatcttcc aattccaatc 
gttcgtgtac gttatgcttc tgcaaccccc 
tcgttttttt caggtacagt accagctacg 
aattttggtt actttgagac cgctaatact 
attagaaatt ttagtgcaaa tgcagatttg 

25 gatgcaacct ccgaggcaga acatgattta 



caattttctt cgcgttctac cagataCcga 60 
attcaagtca atgttcattg ggaaaatagc 120 
gctcagtcat tagataatct acaatcaaac 180 
atttcatctt cattagatgg tatagtaggt 240 
ataatagaca gatttgaatt tatcccagtg 300 
gaaagagcgc aaaaggcg 348 



<210> 24 
<211> 116 
30 <212> PRT 

<213> Bacillus thuringiensis 

<400> 24 

Asn Asn Arg Gly His Leu Pro He Pro He Gin Phe Ser Ser Arg Ser 
35 1 5 10 15 

Thr Arg Tyr Arg Val Arg Val Arg Tyr Ala Ser Ala Thr Pro He Gin 
20 25 30 

40 Val Asn Val His Trp Glu Asn Ser Ser Phe Phe Ser Gly Thr Val Pro 
35 40 45 

Ala Thr Ala Gin Ser Leu Asp Asn Leu Gin Ser Asn Asn Phe Gly Tyr 
50 55 60 

45 

Phe Glu Thr Ala Asn Thr He Ser Ser Ser Leu Asp Gly He Val Gly 
65 70 75 80 

He Arg Asn Phe Ser Ala Asn Ala Asp Leu He He Asp Arg Phe Glu 
50 8 5 9 0 9 5 

Phe He Pro Val Asp Ala Thr Ser Glu Ala Glu His Asp Leu Glu Arg 
100 105 110 



55 Ala Gin Lys Ala 
115 
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<210> 25 
<211> 186 
c212> DNA 
S <213> Bacillus thuringiensis 

<400> 25 

ccactaacat ctcgttcgtt cgctcataca 
gctcaagaag aatttgatct atacatccaa 
10 attccagtta ctgcaacatt tgaggcagaa 
aatgcc 



acactcttca ctccaataac cttttcacga 60 
tcgggtgttt atatagatcg aattgaattt 120 
tatgatttag aaagagcgca aagggcggtg 180 

186 



<210> 26 
15 <211> 62 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 2& 

20 Pro Leu Thr Ser Arg Ser Phe Ala His Thr Thr Leu Phe Thr Pro lie 
15 10 15 

Thr Phe Ser Arg Ala Gin Glu Glu Phe Asp Leu Tyr lie Gin Ser Gly 
20 . 25 30 



25 



Val Tyr lie Asp Arg lie Glu Phe lie Pro Val Thr Ala Thr Phe Glu 
35 40 45 



Ala Glu Tyr Asp Leu Glu Arg Ala Gin Arg Ala Val Asn Ala 
30 50 55 60 



<210> 27 
<211> 3471 
35 <212> DNA 

<213> Bacillus thuringiensis 



<400> 27 

atgaatcgaa ataatcaaaa tgaatatgaa 

40 tcagatgacg atgtgaggta tcctttggca 

aactataaag attacttaca aatgacagat 

agtttatcta ttagtggtag agatgcagtt 

ctcggggctt taggtgttcc gttttctgga 

aatacactgt ggccagttaa tgatacagct 

45 gaacttgtca atcaacaaat aacagaattt 

ggattaggag actcttttaa tgtatatcaa 

aatgatacac gaaatttaag tgttgttcgt 

gttaatgcta ttccattgtt tgcagtaaat 

gcacaagctg tgaatttaca tttgttatta 

50 tggggattca cacaggggga aatttccaca 

aagtacacta attactgtga aacttggtat 

aatactgaaa gttggttaag atatcatcaa 

gatgttgtgg cgctatttcc atattatgat 

cagcttacac gtgaggtata tacagatccg 

55 ctttgccgac gttggggtac taatccctat 

attcgcccac cacatctttt tgataggctg 

ccagtttcat ctaattttat ggattattgg 



attattgatg ccccccattg tgggtgtcca 60 
agtgacccaa atgcagcgtt acaaaatatg i20 
gaggactaca ctgattctta tataaatcct 180 
cagactgcgc ttactgttgt tgggagaata 240 
caaatagtga gtttttatca attcctttta 300 
atatgggaag ctttcatgcg acaggtggag 360 
gcaagaaatc aggcacttgc aagattgcaa 420 
cgttcccttc aaaattggtt ggctgatcga 480 
gctcaattta tagctttaga ccttgatttt 540 
ggacagcagg ttccattact gtcagtatat 600 
ttaaaagatg catctctttt tggagaagga 660 
tattatgacc gtcaattgga actaaccgct 720 
aatacaggtt tagatcgttt aagaggaaca 780 
ttccgtagag aaatgacttt agtggtatta 840 
gtacgacttt atccaacggg atcaaaccca 900 
attgtattta atccaccagc taatgttgga 960 
aatacttttt ctgagctcga aaatgccttc 1020 
aatagcttaa caatcagcag taatcgattt 1080 
tcaggacata cgttacgccg tagttatctg 1140 



wo 01/19859 



PCTAJSOO/25361 



-25- 



aacgattcag cagtacaaga agatagttat ggcctaatta caaccacaag agcaacaatt 1200 
aatcctggag ttgatggaac aaaccgcata gagtcaacgg cagtagattt tcgttctgca 1260 
ttgataggta tatatggcgt gaatagagct tcttttgtcc caggaggctt gtttaatggt 1320 
acgacttctc ctgctaatgg aggatgtaga gatctctatg atacaaatga tgaattacca 1380 
5 ccagatgaaa gtaccggaag ttctacccat agactatctc atgttacctt ttttagtttt 1440 
caaactaatc aggctggatc tatagctaat gcaggaagtg tacctactta tgtttggacc 1500 
cgtcgtgatg tggaccttaa taatacgatt accccaaata gaattacaca attaccattg 1560 
gtaaaggcat ctgcacctgt ttcgggtact acggtcttaa aaggtccagg atttacagga 1620 
gggggtatac tccgaagaac aactaatggc acatttggaa cgttaagagt aacagttaat 1680 

10 tcaccattaa cacaaagata tcgcgtaaga gttcgttttg cttcatcagg aaatttcagc 1740 
ataaggatac tgcgtggaaa tacctctata gcttatcaaa gatttgggag tacaatgaac 1800 
agaggacagg aactaactta cgaatcattt gtcacaagtg agttcactac taatcagagc 1860 
gatctgcctt ttacatttac acaagctcaa gaaaatttaa caatccttgc agaaggtgtt 1920 
agcaccggta gtgaatattt tatagataga attgaaatca tccctgtgaa cccggcacga 1980 
J 15 gaagcagaag aggatttaga agcagcgaag aaagcggtgg cgaacttgtt tacacgtaca 2040 
agggacggat tacaggtaaa tgtgacagat tatcaagtgg accaagcggc aaatttagtg 2100 
tcatgcttat ccgatgaaca atatgggcat gacaaaaaga tgttattgga agcggtaaga 2160 
gcggcaaaac gcctcagccg cgaacgcaac ttacttcaag atccagattt taatacaatc 2220 
aatagtacag aagagaatgg ctggaaggca agtaacggtg ttactattag cgagggcggt 2280 

20 ccattcttta aaggtcgtgc acttcagtta gcaagcgcaa gagaaaatta tccaacatac 2340 
atttatcaaa aagtagatgc atcggtgtta aagccttata cacgctatag actagatgga 2400 
tttgtgaaga gtagtcaaga tttagaaatt gatctcatcc accatcataa agtccatctt 2460 
gtaaaaaatg taccagataa tttagtatct gatacttact cagatggttc ttgcagcgga 2520 
atcaaccgtt gtgatgaaca gcatcaggta gatatgcagc tagatgcgga gcatcatcca 2580 

25 atggattgct gtgaagcggc tcaaacacat gagttttctt cctatattaa tacaggggat 2640 
ctaaatgcaa gtgtagatca gggcatttgg gttgtattaa aagtccgaac aacagatggg 2700 
tatgcgacgt taggaaatct tgaattggta gaggttgggc cattatcggg tgaatctcta 2760 
gaacgggaac aaagagataa tgcgaaatgg aatgcagagc taggaagaaa acgtgcagaa 2820 
atagatcgtg tgtatttagc tgcgaaacaa gcaattaatc atctgtttgt agactatcaa 288 0 

30 gatcaacaat taaatccaga aattgggcta gcagaaatta atgaagcttc aaatcttgta 2940 
gagtcaattt cgggtgtata tagtgataca ctattacaga ttcctgggat taactacgaa 3000 
atttacacag agttatccga tcgcttacaa caagcatcgt atctgtatac gtctagaaat 3060 
gcggtgcaaa atggagactt taacagtggt ctagatagtt ggaatacaac tatggatgca 3120 
tcggttcagc aagatggcaa tatgcatttc ttagttcttt cgcattggga tgcacaagtt 3180 

35 tcccaacaat tgagagtaaa tccgaattgt aagtatgtct tacgtgtgac agcaagaaaa 3240 
• gtaggaggcg gagatggata cgtcacaatc cgagatggcg ctcatcacca agaaactctt 3300 
acatttaatg catgtgacta cgatgtaaat ggtacgtatg tcaatgacaa ttcgtatata 3360 
acagaagaag tggtattcta cccagagaca aaacatatgt gggtagaggt gagtgaatcc 3420 
gaaggttcat tctatiataga cagtattgag tttattgaaa cacaagagta g 3471 

40 

<210> 28 
<211> 1156 
<212> PRT 
45 <213> Bacillus thuringiensis 

<400> 28 

Met Asn Arg Asn Asn Gin Asn Glu Tyr Glu lie lie Asp Ala Pro His 
15 10 15 

50 

Cys Gly Cys Pro Ser Asp Asp Asp Val Arg Tyr Pro Leu Ala Ser Asp 
20 25 30 

Pro Asn Ala Ala Leu Gin Asn Met Asn Tyr Lys Asp Tyr Leu Gin Met 
55 35 40 45 

Thr Asp Glu Asp Tyr Thr Asp Ser Tyr lie Asn Pro Ser Leu Ser lie 
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50 55 60 

Ser Gly Arg Asp Ala Val Gin Thr Ala Leu Thr Val Val Gly Arg lie 
65 70 75 80 

S 

Leu Gly Ala Leu Gly Val Pro Phe Ser Gly Gin lie Val Ser Phe Tyr 
85 90 95 

Gin Phe Leu Leu Asn Thr Leu Trp Pro Val Asn Asp Thr Ala lie Trp 
10 100 105 110 

Glu Ala Phe Met Arg Gin Val Glu Glu Leu Val Asn Gin Gin lie Thr 
115 120 125 

IS Glu Phe Ala Arg Asn Gin Ala Leu Ala Arg Leu Gin Gly Leu Gly Asp 
130 135 140 

Ser Phe Asn Val Tyr Gin Arg Ser Leu Gin Asn Trp Leu Ala Asp Arg 
145 150 155 160 

20 

Asn Asp Thr Arg Asn Leu. Ser Val Val Arg Ala Gin Phe lie Ala Leu 
165 170 175 

Asp Leu Asp Phe Val Asn Ala lie Pro Leu Phe Ala Val Asn Gly Gin 
25 1 80 1 85 1 90 

Gin Val Pro Leu Leu Ser Val Tyr Ala Gin Ala Val Asn Leu His Leu 
195 200 205 

30 Leu Leu Leu Lys Asp Ala Ser Leu Phe Gly Glu Gly Trp Gly Phe Thr 
210 215 220 

Gin Gly Glu lie Ser Thr Tyr Tyr Asp Arg Gin Leu Glu Leu Thr Ala 
225 230 235 240 

35 

Lys Tyr Thr Asn Tyr Cys Glu Thr Trp Tyr Asn Thr Gly Leu Asp Arg 
245 250 255 

Leu Arg Gly Thr Asn Thr Glu Ser Trp Leu Arg Tyr His Gin Phe Arg 
40 260 265 270 

Arg Glu Met Thr Leu Val Val Leu Asp Val Val Ala Leu Phe Pro Tyr 
275 280 285 

45 Tyr Asp Val Arg Leu Tyr Pro Thr Gly Ser Asn Pro Gin Leu Thr Arg 
290 295 300 

Glu Val Tyr Thr Asp Pro lie Val Phe Asn Pro Pro Ala Asn Val Gly 
305 310 315 1 320 

50 

Leu Cys Arg Arg Trp Gly Thr Asn Pro Tyr Asn Thr Phe Ser Glu Leu 
325 330 335 

Glu Asn Ala Phe He Arg Pro Pro His Leu Phe Asp Arg Leu Asn Ser 
55 340 345 350 

Leu Thr He Ser Ser Asn Arg Phe Pro Val Ser Ser Asn Phe Met Asp 
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355 360 365 

Tyr Trp Ser Gly His Thr Leu Arg Arg Ser Tyr Leu Asn Asp Ser Ala 
370 375 380 

5 

Val Gin Glu Asp Ser Tyr Gly Leu lie Thr Thr Thr Arg Ala Thr lie 
385 390 395 400 

Asn Pro Gly Val Asp Gly Thr Asn Arg lie Glu Ser Thr Ala Val Asp 
10 405 410 415 

Phe Arg Ser Ala Leu He Gly He Tyr Gly Val Asn Arg Ala Ser Phe 
420 425 430 

15 Val Pro Gly Gly Leu Phe Asn Gly Thr Thr Ser Pro Ala Asn Gly Gly 
435 440 445 

Cys Arg Asp Leu Tyr Asp Thr Asn Asp Glu Leu Pro Pro Asp Glu Ser 
450 455 460 

20 

Thr Gly Ser Ser Thr His Arg Leu Ser His Val Thr Phe Phe Ser Phe 
465 470 475 480 

Gin Thr Asn Gin Ala Gly Ser He Ala Asn Ala Gly Ser Val Pro Thr 
25 485 490 495 

Tyr Val Trp Thr Arg Arg Asp Val Asp Leu Asn Asn Thr He Thr Pro 
500 505 510 

30 Asn Arg He Thr Gin Leu Pro Leu Val Lys Ala Ser Ala Pro Val Ser 
515 520 525 

Gly Thr Thr Val Leu Lys Gly Pro Gly Phe Thr Gly Gly Gly He Leu 
530 535 540 

35 

Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu Arg Val Thr Val Asn 
545 550 555 560 

Ser Pro Leu Thr Gin Arg Tyr Arg Val Arg Val Arg Phe Ala Ser Ser 
40 5 6 5 5 7 0 5 75 

Gly Asn Phe Ser He Arg He Leu Arg Gly Asn Thr Ser He Ala Tyr 
580 585 590 

45 Gin Arg Phe Gly Ser Thr Met Asn Arg Gly Gin Glu Leu Thr Tyr Glu 
595 600 605 

Ser Phe Val Thr Ser Glu Phe Thr Thr Asn Gin Ser Asp Leu Pro Phe 
610 615 620 

50 

Thr Phe Thr Gin Ala Gin Glu Asn Leu Thr He Leu Ala Glu Gly Val 
625 630 635 640 

Ser Thr Gly Ser Glu Tyr Phe He Asp Arg He Glu He He Pro Val 
55 6 4 5 6 5 0 6 5 5 

Asn Pro Ala Arg Glu 70.a Glu Glu Asp Leu Glu Ala Ala Lys Lys Ala 
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660 665 670 

Val Ala Asn Leu Phe Thr Arg Thr Arg Asp Gly Leu Gin Val Asn Val 
675 680 685 

5 

Thr Asp Tyr Gin Val Asp Gin Ala Ala Asn Leu Val Ser Cys Leu Ser 
690 695 700 

Asp Glu Gin Tyr Gly His Asp Lys Lys Met Leu Leu Glu Ala Val Arg 
10 705 710 715 720 

Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gin Asp Pro Asp 
725 730 735 

15 Phe Asn Thr lie Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser Asn ' 
740 745 750 

Gly Val Thr lie Ser Glu Gly Gly Pro Phe Phe Lys Gly Arg Ala Leu 
755 760 765 

20 

Gin Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr lie Tyr Gin Lys 
770 775 780 

Val Asp Ala Ser Val Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp Gly 
25 785 790 795 800 

Phe Val Lys Ser Ser Gin Asp Leu Glu lie Asp Leu He His His His 
805 810 815 

30 Lys Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser Asp Thr 
820 825 830 

Tyr Ser Asp Gly Ser Cys Ser Gly He Asn Arg Cys Asp Glu Gin His 
835 840 845 

35 

Gin Val Asp Met Gin Leu Asp Ala Glu His His Pro Met Asp Cys Cys 
850 855 860 

Glu Ala Ala Gin Thr His Glu Phe Ser Ser Tyr He Asn Thr Gly Asp 
40 865 870 875 880 

Leu Asn Ala Ser Val Asp Gin Gly He Trp Val Val Leu Lys Val Arg 
885 890 895 

45 Thr Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu Val 
900 905 910 

Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gin Arg Asp Asn Ala 
915 920 925 

50 

Lys Trp Asn Ala Glu Leu Gly Arg Lys Arg Ala Glu He Asp Arg Val 
930 935 940 

Tyr Leu Ala Ala Lys Gin Ala He Asn His Leu Phe Val Asp Tyr Gin 
55 945 950 955 960 

Asp Gin Gin Leu Asn Pro Glu He Gly Leu Ala Glu He Asn Glu Ala 
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965 970 975 

Ser Asn Leu Val Glu Ser lie Ser Gly Val Tyr Ser Asp Thr Leu Leu 
980 985 990 

^ Gin lie Pro Gly He Asn Tyr Glu He Tyr Thr Glu Leu Ser Asp Arg 
995 1000 1005 

Leu Gin Gin Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val Gin Asn 
10 1010 1015 1020 

Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Thr Thr Met Asp Ala 
1025 1030 1035 1040 

i 15 Ser Val Gin Gin Asp Gly Asn Met His Phe Leu Val Leu Ser His Trp 

1045 1050 1055 

Asp Ala Gin Val Ser Gin Gin Leu Arg Val Asn Pro Asn Cys Lys Tyr 
1060 1065 1070 

20 

Val Leu Arg Val Thr Ala Arg Lys Val Gly Gly Gly Asp Gly Tyr Val 
1075 1080 1085 

Thr He Arg Asp Gly Ala His His Gin Glu Thr Leu Thr Phe Asn Ala 
25 1090 1095 1100 

Cys Asp Tyr Asp Val Asn Gly Thr Tyr Val Asn Asp Asn Ser Tyr He 
1105 1110 1115 1120 

30 Thr Glu Glu Val Val Phe Tyr Pro Glu Thr Lys His Met Trp Val Glu 
1125 1130 1135 

Val Ser Glu Ser Glu Gly Ser Phe Tyr He Asp Ser He Glu Phe He 
1140 1145 1150 

35 

' Glu Thr Gin Glu 
1155 



40 <210> 29 

<211> 2407 

<212> DNA 

<213> Bacillus thuringiensis 



45 <400> 29 

atgaatcaaa ataaacacgg aattattggc 
gttgcgaaat atcctttagc caacaatcca 
caaaatagta gtattctcaa ctggattaac 
tctattggga caaccatagt ctctcttatc 

50 atagtatatg accttatagg taaagtacta 
ttgtctatat gtgacttatt atctattatt 
gatgggattg cagattttaa tggttctgta 
gatagctgga ataagaatcc taattctgct 
atcgccgact cagaatttga tagaatttta 

55 ttagctagac aaaatgccca aatattatta 
catttattac tactaaggga tgctactaga 
acacctttta taaattatca atcaaaacta 



gcttccaatt gtggttgtgc atctgatgat 60 
tattcatctg ctttaaattt aaattcttgt 120 
ataataggcg atgcagcaaa agaagcagta 180 
acagcacctt ctcttactgg attaatttca 240 
ggaggtagta gtggacaatc catatcagat 300 
gatttacggg taagtcagag tgttttaaat 360 
ctcttataca ggaactattt agaggctctg 420 
tctgctgaag aactccgtac tcgttttaga 480 
acccgagggt ctttaacgaa tggtggctcg 540 
ttaccttctt ttgcgagcgc tgcatttttc 600 
tatggcacta attgggggct atacaatgct 660 
gtagagctta ttgaactata tactgattat 720 
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tgcgtacatt gggataatcg aggttcaacc 
gcttggttag aatttcatag atatcggaga 
gcatcatttt caagtcttga tattactaat 
agggtcattt atacagatcc aattggtttt 
s tggtttagct ttgttaatag agctaatttc 
agaccgtctt: ggtttttaaa taatatgatt 
agcccaagta ctgatagagc gagggtatgg 
aattcacaat ttattactga actaatctct 
ttagggcgaa atatatttag agtagattct 

10 ggagtgaata gggcggtatt ttatcatgat 
gaggggtata ttcgaacaac tgggatagat 
ttacctggag aaaattcaga tatcccaact 
acaataaatt taacaggagg acttagacaa 
atgtatggtt ggacacataa aagtctggct 

IS acacagatac cattgacgaa ggttgatacc 
ccaggattta taggaggagc tctacttcaa 
agggtccaat ttccacttca cttaagacaa 
acaacaaata ttcgattgag tgtgaatggc 
agtacaatga gattaggaga ggatttaaga 

20 acttctatta gacccactgc aagtccggac 
attagacaag aggtctatgt agatagaatt 
gcgaaagagg atctagaagc agcaaaaaaa 
gacggatt:ac aagtaaatgt: gaaagattat: 
tgcttatcag atgaacaata tgggtatgac 

25 gcaaaacgac ttagccgaga acgcaactta 
agtacagaag aaaatggatg gaaagcaagt 
ttctataaag gccgtgcaat tcagctagca 
tatcaaaaag tagatgcatc ggagttaaag 
gtgaaga 

30 



gaactaagac aacgagggcc tagtgctaca 780 
gagatgacat tgatgggatt agaaatagta 840 
tacccaatag aaacagattt tcagttgagt 900 
gtacatcgta gtagtcttag gggagaaagt 960 
tcagatttag aaaatgcaat acctaatcct 1020 
atatctactg gttcacttac attgccggtt 1080 
tatggaagtc gagatcgaat ttcccctgct 1140 
ggacaacata cgactgctac acaaactatt 1200 
caagcttgta atttaaatga taccacatat 1260 
gcgagtgaag gttctcaaag atccgtgtac 1320 
aaccctagag ttcaaaatat taacacttat 1380 
ccagaagact atactcatat attaagcaca 1440 
gtagcatcta atcgccgttc atctttagta 1500 
cgtaacaata ccattaatcc agatagaatt 1560 
cgaggcacag gtgtttctta tgtgaatgat 162 0 
aggactgacc atggttcgct tggagtattg 1680 
caatatcgta ttagagtccg ttatgcttct 1740 
agtttcggta ctatttctca aaatctccct 1800 
tacggatctt ttgctataag agagtttaat 1860 
caaattcgat tgacaataga accatctttt 1920 
gagttcattc cagttaatcc gacgcgagag 1980 
gcggtggcga gcttgtttac acgcacaagg 2040 
'caagtcgatc aagcggcaaa tttagtgtca 2100 
aaaaagatgt tattggaagc ggtacgtgcg 2160 
cttcaggatc cagattttaa tacaatcaat 2220 
aacggcgtta ctattagtga gggcgggcca 2280 
agtgcacgag aaaattaccc aacatacatc 2340 
ccgtatacac gttatagact ggatgggttc 2400 

2407 



<210> 30 
<211> 802 
<212> PRT 
35 <213> Bacillus thuringiensis 

<400> 30 

Met Asn Gin Asn Lys His Gly lie lie Gly Ala Ser Asn Cys Gly Cys 
15 10 15 



40 



Ala Ser Asp Asp Val Ala Lys Tyr Pro Leu Ala Asn Asn Pro Tyr Ser 
20 25 30 



Ser Ala Leu Asn Leu Asn Ser Cys Gin Asn Ser Ser lie Leu Asn Trp 
45 35 40 45 

He Asn He He Gly Asp Ala Ala Lys Glu Ala Val Ser He Gly Thr 
50 55 60 

50 Thr He Val Ser Leu He Thr Ala Pro Ser Leu Thr Gly Leu He Ser 
65 70 75 80 



55 



He Val Tyr Asp Leu He Gly Lys Val Leu Gly Gly Ser Ser Gly Gin 
85 90 95 

Ser He Ser Asp Leu Ser He Cys Asp Leu Leu Ser He He Asp Leu 
100 105 110 
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Arg Val Ser Gin Ser Val Leu Asn Asp Gly lie Ala Asp Phe Asn Gly 
115 120 125 

5 Ser Val Leu Leu Tyr Arg Asn Tyr Leu Glu Ala Leu Asp Ser Trp Asn 
130 135 140 

Lvs Asn Pro Asn Ser Ala Ser Ala Glu Glu Leu Arg Thr Arg Phe Arg 
145 150 155 160 

He Ala ASP Ser Glu Phe Asp Arg He Leu Thr Arg Gly Ser Leu Thr 
165 170 175 

Asn Gly Gly Ser Leu Ala Arg Gin Asn Ala Gin He Leu Leu Leu Pro 
J 15 180 185 190 

Ser Phe Ala Ser Ala Ala Phe Phe His Leu Leu Leu Leu Arg Asp Ala 
195 200 205 

20 Thr Arg Tyr Gly Thr Asn Trp Gly Leu Tyr Asn Ala Thr Pro Phe He 
210 215 220 

Asn Tyr Gin Ser Lys Leu Val Glu Leu He Glu Leu Tyr Thr Asp Tyr 
225 230 235 240 

25 

Cys Val His Trp Asp Asn Arg Gly Ser Thr Glu Leu Arg Gin Arg Gly 

245 250 255 

Pro ser Ala Thr Ala Trp Leu Glu Phe His Arg Tyr Arg Arg Glu Met 
30 2 6 0 2 6 5 2 70 

Thr Leu Met Gly Leu Glu He Val Ala Ser Phe Ser Ser Leu Asp He 
275 280 285 

35 Thr Asn Tyr Pro He Glu Thr Asp Phe Gin Leu Ser Arg Val He Tyr 
290 295 300 

Thr Asp Pro He Gly Phe Val His Arg Ser Ser Leu Arg Gly Glu Ser 
305 310 315 320 

40 

Trp Phe Ser Phe Val Asn Arg Ala Asn Phe Ser Asp Leu Glu Asn Ala 
325 330 335 

He Pro Asn Pro Arg Pro Ser Trp Phe Leu Asn Asn Met He He Ser 
45 340 345 350 

Thr Gly Ser Leu Thr Leu Pro Val Ser Pro Ser Thr Asp Arg Ala Arg 
355 360 365 

50 Val Trp Tyr Gly Ser Arg Asp Arg He Ser Pro Ala Asn Ser Gin Phe 
370 375 380 

He Thr Glu Leu He Ser Gly Gin His Thr Thr Ala Thr Gin Thr He 
385 390 395 400 

Leu Gly Arg Asn He Phe Arg Val Asp Ser Gin Ala Cys Asn Leu Asn 
405 410 415 
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Asp Thr Thr Tyr Gly Val Asn Arg Ala Val Phe Tyr His Asp Ala Ser 
420 425 430 

5 Glu Gly Ser Gin Arg Ser Val Tyr Glu Gly Tyr lie Arg Thr Thr Gly 
435 440 445 

lie Asp Asn Pro Arg Val Gin Asn lie Asn Thr Tyr Leu Pro Gly Glu 
450 455 460 

10 

Asn Ser Asp lie Pro Thr Pro Glu Asp Tyr Thr His lie Leu Ser Thr 
465 470 475 480 

Thr lie Asn Leu Thr Gly Gly Leu Arg Gin Val Ala Ser Asn Arg Arg 
15 485 490 495 

Ser Ser Leu Val Met Tyr Gly Trp Thr His Lys Ser Leu Ala Arg Asn 
500 505 510 

20 Asn Thr lie Asn Pro Asp Arg lie Thr Gin He Pro Leu Thr Lys Val 
515 520 525 

Asp Thr Arg Gly Thr Gly Val Ser Tyr Val Asn Asp Pro Gly Phe He 
530 535 540 

25 

Gly Gly Ala Leu Leu Gin Arg Thr Asp His Gly Ser Leu Gly Val Leu 

545 550 555 560 

Arg Val Gin Phe Pro Leu His Leu Arg Gin Gin Tyr Arg He Arg Val 
30 565 570 575 

Arg Tyr Ala Ser Thr Thr Asn He Arg Leu Ser Val Asn Gly Ser Phe 
580 585 590 

35 Gly Thr He Ser Gin Asn Leu Pro Ser Thr Met Arg Leu Gly Glu Asp 
595 600 605 

Leu Arg Tyr Gly Ser Phe Ala He Arg Glu Phe Asn Thr Ser He Arg 
610 615 620 

40 

Pro Thr Ala Ser Pro Asp Gin He Arg Leu Thr He Glu Pro Ser Phe 
625 630 635 640 

He Arg Gin Glu Val Tyr Val Asp Arg He Glu Phe He Pro Val Asn 
45 645 650 655 

Pro Thr Arg Glu Ala Lys Glu Asp Leu Glu Ala Ala Lys Lys Ala Val 

660 665 670 

50 Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gin Val Asn Val Lys 
675 680 685 

Asp Tyr Gin Val Asp Gin Ala Ala Asn Leu Val Ser Cys Leu Ser Asp 
690 695 700 

55 

Glu Gin Tyr Gly Tyr Asp Lys Lys Met Leu Leu Glu Ala Val Arg Ala 
705 710 715 720 



♦ 
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Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gin Asp Pro Asp Phe 
725 730 735 

5 Asn Thr He Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser Asn Gly 
740 745 750 

Val Thr He Ser Glu Gly Gly Pro Phe Tyr Lys Gly Arg Ala He Gin 
755 760 765 

Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr He Tyr Gin Lys Val 
770 775 780 

Asp Ala Ser Glu Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp Gly Phe 
/ 15 785 790 795 800 

Val Lys 



20 

<210> 31 
<211> 192 
<212> DNA 

<213> Bacillus thuringiensis 
<400> 31 

catttacgca acctcgtatg gatttcattt ccaagaacta tgggaacaga tgacccatta 60 
acttctcgtt cgtttgctct tacaactctt ttcacaccaa taaccttaac acgagcacaa 120 
gaagaattta atctaacaat accacggggt gtttatatag acagaattga attcgtccca 180 
30 gttatgccac at 



<210> 32 
<211> 64 
35 <212> PRT 

<213> Bacillus thuringiensis 



40 



45 



50 



<400> 32 

His Leu Arg Asn Leu Val Trp He Ser Phe Pro Arg Thr Met Gly Thr 
1 5 10 15 

Asp Asp Pro Leu Thr Ser Arg Ser Phe Ala Leu Thr Thr Leu Phe Thr 
20 25 30 

Pro He Thr Leu Thr Arg Ala Gin Glu Glu Phe Asn Leu Thr He Pro 
35 40 45 

Arg Gly Val Tyr He Asp Arg He Glu Phe Val Pro Val Met Pro His 
50 55 60 



55 <210> 33 
<211> 246 
<212> DNA 
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10 



15 



25 



<213> Bacillus thuringiensis 

<400> 33 

gcttctacta caaatttaca attccataca tcaattgacg gaagacctat taatcagggg 60 

aatttttcag caactatgag tagtgggggt aatttacagt ccggaagctt taggactgca 120 

ggctttacta ctccgtttaa cttttcaaat ggatcaagta tatttacgtt aagtgctcat 180 

gtcttcaatt caggcaatga agtttatata gatcgaattg aatttgttcc ggcagaagta 240 
acattt 246 



<210> 34 
<211> 82 
<212> PRT 

<213> Bacillus thuringiensis 
c400> 34 

Ala Ser Thr Thr Asn Leu Gin Phe His Thr Ser lie Asp Gly Arg Pro 
15 10 15 

20 He Asn Gin Gly Asn Phe Ser Ala Thr Met Ser Ser Gly Gly Asn Leu 
20 25 30 



Gin Ser Gly Ser Phe Arg Thr Ala Gly Phe Thr Thr Pro Phe Asn Phe 
35 40 45 

Ser Asn Gly Ser Ser lie Phe Thr Leu Ser Ala His Val Phe Asn Ser 
50 55 60 



Gly Asn Glu Val Tyr lie Asp Arg He Glu Phe Val Pro Ala Glu Val 
30 65 70 75 80 

Thr Phe 



35 

<210> 35 
<211> 177 
<212> DNA 

<213> Bacillus thuringiensis 

40 

<400> 35 

ctctttccag attatattca gcctcgagtg ttgcagtaac tggaataaat tcaaatctgt 60 
ctattatcac tcctgcagtc ccactaaaat ttctaacacc tactatatta cctaatgaag 120 
atgtaaaagc attggcactt caaaatcact tgattgtaga ttatctaatg acgtagc 177 



<210> 36 
<211> 57 
<212> PRT 
50 <213> Bacillus thuringiensis 



<400> 36 

Leu Ser Arg Leu Tyr Ser Ala Ser Ser Val Ala Val Thr Gly He Asn 
15 10 15 

Ser Asn Leu Ser lie He Thr Pro Ala Val Pro Leu Lys Phe Leu Thr 
20 25 30 
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Pro Thr He Leu Pro Asn Glu Asp Val Lys Ala Leu Ala Leu Gin Asn 
35 40 45 

5 His Leu He Val Asp Tyr Leu Met Thr 
SO 55 



<210> 37 
10 <211> 4173 
<212> DNA 

<213> Bacillus thuringiensis 

<220> 
) 15 <221> CDS 

<222> (1) . . (3687) 

<400> 37 

ttg act tea aat agg aaa aat gag aat gaa att ata aat get tta teg 
20 Met Thr Ser Asn Arg Lys Asn Glu Asn Glu He He Asn Ala Leu Ser 
15 10 15 



25 



30 



35 



aga ata eta ggt gta tta gge gta eeg ttt get gga caa ata get agt 
Arg He Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gin He Ala Ser 
65 70 75 80 

ttt tat agt ttt ctt gtt ggt gaa tta tgg ccc cgc ggc aga gat cag 
Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Gin 
85 90 95 

tgg gaa att ttc eta gaa cat gtc gaa caa ctt ata aat caa caa ata 
Trp Glu He Phe Leu Glu His Val Glu Gin Leu He Asn Gin Gin He 
100 105 110 

aca gaa aat get agg aat acg gca ctt get ega tta caa ggt tta gga 
Thr Glu Asn Ala Arg Asn Thr Ala Leu Ala Arg Leu Gin Gly Leu Gly 
115 120 125 

gat tec ttt aga gcc tat caa cag tea ctt gaa gat tgg eta gaa aac 
Asp Ser Phe Arg Ala Tyr Gin Gin Ser Leu Glu Asp Trp I*eu Glu Asn 
130 135 140 

55 cgt gat gat gca aga acg aga agt gtt ctt tat ace caa tat ata gee 
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gin Tyr He Ala 
145 150 155 160 



40 



45 



50 



48 



96 



att cca get gta teg aat cat tec aca caa atg gat eta tea cea gat 
He Pro Ala Val Ser Asn His Ser Thr Gin Met Asp Leu Ser Pro Asp 
20 25 30 

get cgt att gag gat tet ttg tgt ata gcc gag ggg aat aat ate aat 144 
Ala Arg He Glu Asp Ser Leu Cys He Ala Glu Gly Asn Asn He Asn 
35 40 45 

cca ctt gtt age gca tea aca gtc caa acg ggt att aac ata get ggt 192 
Pro Leu Val Ser Ala Ser Thr Val Gin Thr Gly . He Asn He Ala Gly 
50 55 60 



240 



288 



336 



384 



432 



480 
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tta gaa ctt gat ttt ctt aat gcg atg ccg ctt ttc gca att aga aac 528 
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala lie Arg Asn 

165 170 175 

5 

caa gaa gtt cca tta tta atg gta tat get caa get gca aat tta cac 576 
Gin Glu Val Pro Leu Leu Met Val Tyr Ala Gin Ala Ala Asn Leu His 
180 185 190 

10 eta tta tta ttg aga gat gcc tct ctt ttt ggt agt gaa ttt ggg ctt 624 
Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu 
195 200 205 

aca teg cag gaa att caa cgt tat tat gag cgc caa gtg gaa caa acg 672 
15 Thr Ser Gin Glu lie Gin Arg Tyr Tyr Glu Arg Gin Val Glu Gin Thr • 
210 215 220 

aga gat tat tec gac tat tgc gta gaa tgg tat aat aca ggt eta aat 720 
Arg Asp Tyr Ser Asp Tyr Cys Val Glu Trp Tyr Asn Thr Gly Leu Asn 
20 225 230 235 240 

age ttg aga ggg aca aat gcc gca agt tgg gtg cgt tat aat caa ttc 768 
Ser Leu Arg Gly Thr Asn Ala Ala Ser Trp Val Arg Tyr Asn Gin Phe 
245 250 255 

25 

cgt aga gat eta acg tta ggg gta tta gat eta gtg gca eta ttc cca 816 
Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro 
260 265 270 

30 age tat gac act cgc act tat cca ata aat acg agt get cag tta aca 864 
Ser Tyr Asp Thr Arg Thr Tyr Pro lie Asn Thr Ser Ala Gin Leu Thr 
275 280 285 

agg gaa gtt tat aca gac gca att gga gca aca ggg gta aat atg gca 912 
35 Arg Glu Val Tyr Thr Asp Ala lie Gly Ala Thr Gly Val Asn Met Ala -^.^ 
290 295 300 

agt atg aat tgg tat aat aat aat gca cct teg ttt tec get ata gag 960 
Ser Met Asn Trp Tyr Asn Asn Asn Ala Pro Ser Phe Ser Ala lie Glu 
40 305 310 315 320 

act gcg gtt ate cga age ccg cat eta ctt gat ttt eta gaa caa ctt 1008 

Thr Ala Val lie Arg Ser Pro His Leu Leu Asp Phe Leu Glu Gin Leu 
325 330 335 

45 

aca att ttt age act tea tea cga tgg agt get act agg cat atg act 1056 

Thr He Phe Ser Thr Ser Ser Arg Trp Ser Ala Thr Arg His Met Thr 
340 345 350 

50 tac tgg egg ggg cac aca att caa tct egg cca ata gga ggc gga tta 1104 
Tyr Trp Arg Gly His Thr He Gin Ser Arg Pro He Gly Gly Gly Leu 
355 360 365 

aat ace tea acg cat ggg tct ace aat act tct att aat cct gta aga 1152 
55 Asn Thr Ser Thr His Gly Ser Thr Asn Thr Ser lie Asn Pro Val Arg 
370 375 380 
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tta tea ttc ttc tct cga gac gta tat tgg act gaa tea tat gca gga 1200 
Leu Ser Phe Phe Ser Arg Asp Val Tyr Trp Thr Glu Ser Tyr Ala Gly 
385 390 395 400 

5 gtg ett eta tgg gga att tac ctt gaa cot att cat ggt gtc cct act 1248 
Val Leu Leu Trp Gly lie Tyr Leu Glu Pro lie His Gly Val Pro Thr 
405 410 415 

gtt aga ttt aat ttt agg aac cct cag aat act ttt gaa aga ggt act 1296 
10 Val Arg Phe Asn Phe Arg Asn Pro Gin Asn Thr Phe Glu Arg Gly Thr 
420 425 430 

get aac tat agt caa ecc tat gag tea cct ggg ctt caa tta aaa gat 1344 
Ala Asn Tyr Ser Gin Pro Tyr Glu Ser Pro Gly Leu Gin Leu Lys Asp 
15 435 440 445 

tea gaa act gaa tta eca cca gaa aca aca gaa ega eca aat tat gaa 1392 

Ser Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu 
450 455 460 

20 

tea tat agt cat agg tta tct cae ata ggg etc att tea caa tct agg 1440 

Ser Tyr Ser His Arg Leu Ser His lie Gly Leu lie Ser Gin Ser Arg 

465 470 475 480 



25 gtg eat gta eca gta tat tct tgg acg cae cgt agt gea gat cgt aca 
Val His Val Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr 
485 490 495 



1488 



aat ace att agt tea gat age ata aca caa ata cca ttg gta aaa tea 1536 

30 Asn Thr lie Ser Ser Asp Ser lie Thr Gin lie Pro Leu Val Lys Ser 

500 505 510 

ttc aac ctt aat tea ggt aec tct gta gtc agt ggc eca gga ttt aca 1584 

Phe Asn Leu Asn Ser Gly Thr Ser Val Val Ser Gly Pro Gly Phe Thr 

35 515 520 525 

gga ggg gat ata ate ega act aac gtt aat ggt agt gta eta agt atg 1632 

Gly Gly Asp lie lie Arg Thr Asn Val Asn Gly Ser Val Leu Ser Met 

530 535 540 

40 

ggt ctt aat ttt aat aat aca tea tta cag egg tat egc gtg aga gtt 1680 

Gly Leu Asn Phe Asn Asn Thr Ser Leu Gin Arg Tyr Arg Val Arg Val 

545 550 555 560 

45 cgt tat get get tct caa aca atg gtc ctg agg gta act gtc gga ggg 1728 

Arg Tyr Ala Ala Ser Gin Thr Met Val Leu Arg Val Thr Val Gly Gly 
565 570 575 

agt act act ttt gat caa gga ttc cct agt act atg agt gca aat gag 1776 

50 Ser Thr Thr Phe Asp Gin Gly Phe Pro Ser Thr Met Ser Ala Asn Glu 

580 585 590 

tct ttg aca tct caa tea ttt aga ttt gca gaa ttt cct gta ggt att 1824 

Ser Leu Thr Ser Gin Ser Phe Arg Phe Ala Glu Phe Pro Val Gly lie 

55 595 600 605 

agt gca tct ggc agt caa act get gga ata agt ata agt aat aat gea 1872 
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Ser Ala Ser Gly Ser Gin Thr Ala Gly lie Ser lie Ser Asn Asn Ala 
610 615 620 

ggt aga caa acg ttt cac ttt gat aaa att gaa ttc att cca att act 1920 
5 Gly Arg Gin Thr Phe His Phe Asp Lys lie Glu Phe He Pro He Thr 
625 630 635 640 

gca acc ttc gaa gca gaa tac gat tta gaa agg gcg caa gag gcg gtg 1968 
Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Glu Ala Val 
10 645 650 655 

aat get ctg ttt act aat acg aat cca aga aga ttg aaa aca gat gtg 2016 
Asn Ala Leu Phe Thr Asn Thr Asn Pro Arg Arg Leu Lys Thr Asp Val 
660 665 670 

aca gat tat cat att gat caa gta tec aat tta gtg gcg tgt tta teg 2064 
Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val Ala Cys Leu Ser 
675 680 685 

20 gat gaa ttc tgc tta gat gaa aag aga gaa tta ctt gag aaa gtg aaa 2112 
Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Leu Glu Lys Val Lys 
690 695 700 



IS 



tat gcg aaa cga etc agt gat gaa aga aac tta etc caa gat cca aac 
25 Tyr Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn 
705 710 715 720 



2160 



ttc aca tec ate aat aag caa cca gac ttc ata tct act aat gag caa 2208 
Phe Thr Ser He Asn Lys Gin Pro Asp Phe He Ser Thr Asn Glu Gin 
30 725 730 735 

teg aat ttc aca tct ate cat gaa caa tct gaa cat gga tgg tgg gga 2256 
Ser Asn Phe Thr Ser He His Glu Gin Ser Glu His Gly Trp Trp Gly 
740 745 750 

35 

agt gag aac att aca ate cag gaa gga aat gac gta ttt aaa gag aat 2304 
Ser Glu Asn He Thr He Gin Glu Gly Asn Asp Val Phe Lys Glu Asn 
755 760 765 

40 tac gtc aca eta ceg ggg act ttt aat gag tgt tat ceg acg tat tta 2352 
Tyr Val Thr Leu Pro Gly Thr Phe Asn Glu Cys Tyr Pro Thr Tyr Leu 
770 775 780 

tat caa aaa ata gga gag teg gaa tta aaa get tat act cgc tac caa 2400 
45 Tyr Gin Lys He Gly Glu Ser Glu Leu Lys Ala Tyr Thr Arg Tyr Gin 
785 790 795 800 

tta aga ggg tat att gaa gat agt caa gat tta gag ata tat ttg att 2448 
Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He 
50 805 810 815 

cgt tat aat gcg aaa cat gaa aca ttg gat gtt cca ggt acc gag tee 2496 

Arg Tyr Asn Ala Lys His Glu Thr Leu Asp Val Pro Gly Thr Glu Ser 
820 825 830 

55 

gta tgg ceg ctt tea gtt gaa age cca ate gga agg tgc gga gaa ceg 2544 

Val Trp Pro Leu Ser Val Glu Ser Pro He Gly Arg Cys Gly Glu Pro 
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2592 



2640 



835 840 845 

aat cga tgc gca cca cat ttt gaa tgg aat cct gat eta gat tgt tec 
Asn Arg Cys Ala Pro His Phe Glu Trp Asn Pro Asp Leu Asp Cys Ser 
5 850 855 860 

tgc aga gat gga gaa aaa tgt gcg cat cat tec cat cat ttc tct ttg 
Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu 
865 870 875 880 

10 

gat att gat att gga tgc aca gac ttg cat gag aat eta ggc gtg tgg 2688 
Asp lie Asp lie Gly Cys Thr Asp Leu His Glu Asn Leu Gly Val Trp 
885 890 895 

15 gtg gta ttc aag att aag acg cag gaa ggt cat gca aga eta ggg aat 2736 
Val val Phe Lys lie Lys Thr Gin Glu Gly His Ala Arg Leu Gly Asn 
900 905 910 

ctg gaa ttt att gaa gag aaa cca tta tta gga gaa gca ctg tct cgt 2784 
20 Leu Glu Phe lie Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ser Arg 
915 920 925 

gtg aag aga gca gag aaa aaa tgg aga gac aaa cgt gaa aaa eta caa 2832 
Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin 
25 930 935 940 

ttg gaa aca aaa cga gta tat aca gag gca aaa gaa get gtg gat get 2880 

Leu Glu Thr Lys Arg Val Tyr Thr Glu Ala Lys Glu Ala Val Asp Ala 
945 950 955 960 

30 

tta ttt gta gat tct caa tat aat aga tta caa gcg gat aca aac att 2 928 

Leu Phe Val Asp Ser Gin Tyr Asn Arg Leu Gin Ala Asp Thr Asn lie 
965 970 975 

35 gge atg att cat gcg gca gat aaa ctt gtt cat cga att cga gag get 2976 
Gly Met lie His Ala Ala Asp Lys Leu Val His Arg lie Arg Glu Ala 
980 985 990 

tat ctg tea gaa tta tct gtt ate ccg ggt gta aat gcg gaa att ttt 3024 
40 Tyr Leu Ser Glu Leu Ser Val lie Pro Gly Val Asn Ala Glu lie Phe 
995 1000 1005 

gaa gaa tta gaa ggt egc att ate act gca ate tec eta tac gat gcg 3072 
Glu Glu Leu Glu Gly Arg lie lie Thr Ala lie Ser Leu Tyr Asp Ala 
45 1010 1015 1020 

aga aat gtc gtt aaa aat ggt gat ttt aat aat gga tta gca tgc tgg 3120 

Arg Asn Val Val Lys Asn Gly Asp Phe Asn Asn Gly Leu Ala Cys Trp 
1025 1030 1035 1040 

50 

aat gta aaa ggg cat gta gat gta caa cag age cat eac cgt tct gtc 3168 

Asn Val Lys Gly His Val Asp val Gin Gin Ser His His Arg Ser Val 
1045 1050 1055 

55 ctt gtt ate cca gaa tgg gaa gca gaa gtg tea caa gca gtt cgc gtc 3216 
Leu Val lie Pro Glu Trp Glu Ala Glu Val Ser Gin Ala Val Arg Val 
1060 1065 1070 
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tgt ccg ggg cgt ggc tat ate etc cgt gtc aca gcg tac aaa gag gga 
Cys Pro Gly Arg Gly Tyr lie Leu Arg Val Thr Ala Tyr Lys Glu Gly 
1075 1080 1085 

5 

tat gga gag ggt tgt gta acg ate cat gaa ate gag aac aat aca gac 
Tyr Gly Glu Gly Cys Val Thr lie His Glu He Glu Asn Asn Thr Asp 
1090 1095 1100 

10 gaa eta aaa ttt aaa aac tgt gaa gaa gag gaa gtg tat cca acg gat 
Glu Leu Lys Phe Lys Asn Cys Glu Glu Glu Glu Val Tyr Pro Thr Asp 
1105 1110 1115 1120 

aca gga acg tgt aat gat tat act gea cac caa ggt aca gca gea tgt 
15 Thr Gly Thr Cys Asn Asp Tyr Thr Ala His Gin Gly Thr Ala Ala Cys 
1125 1130 1135 

aat tec cgt aat get gga tat gag gat gca tat gaa gtt gat act aca 
Asn Ser Arg Asn Ala Gly Tyr Glu Asp Ala Tyr Glu Val Asp Thr Thr 
20 1140 1145 1150 

gca tct gtt aat tac aaa ccg act tat gaa gaa gaa acg tat aca gat 
Ala Ser Val Asn Tyr Lys Pro Thr Tyr Glu Glu Glu Thr Tyr Thr Asp 
1155 1160 1165 

gta cga aga gat aat eat tgt gaa tat gac aga ggg tat gtg aat tat 
Val Arg Arg Asp Asn His Cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr 
1170 1175 1180 

cca cca eta cca get ggt tat gtg aca aag gaa tta gaa tat ttc cca 
Pro Pro Leu Pro Ala Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro 
1185 1190 1195 1200 

gaa ace gat aag gta tgg att gag att gga gaa acg gaa gga aca ttc 
35 Glu Thr Asp Lys Val Trp He Glu He Gly Glu Thr Glu Gly Thr Phe 

1205 1210 1215 

ate gtg gac age ata gaa tta etc ctt atg gaa gaa tag gaccgteega 
He Val Asp Ser He Glu Leu Leu Leu Met Glu Glu 
40 1220 1225 



25 



30 



45 



gtatagcagt ttaataaatc ttaateaaaa tagtagtcta acttcegtta eaatttaata 
agtaaattae agttgtaaaa agaaaaegga catcaetect aagagagcga tgtccgtttt 
ctatatggtg tgtgctaaeg ataagtgtac acggaatttc attatceaaa ttaatattta 
tttgagaaaa ggateatgtt atatagagat atttccttat aatatttgtt ceaegtteat 
50 aatttttgaa tgatacatta eaacaaaaac tgtcaeaaat ttatatgtte tacataaaat 
atatggttaa gaacctaaga agttatgaat caagtaatag gataaaactg aaaaaggaag 
tgtaggtaca atgaataaaa aaataagaaa tgaagatgag catteatega tagaattate 
atatagtact tcaaaaaatc aaaageataa ggtaceattt tgttgtaeaa tttcag 



55 



3264 

3312 

3360 

3408 

3456 

3504 

3552 

3600 

3648 

3697 

3757 
3817 
3877 
3937 
3997 
4057 
4117 
4173 
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<210> 38 
<211> 1228 
<212> PRT 
5 <213> Bacillus thuringiensis 
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20 
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Asn Lys Gin Pro Asp 


Phe 


He Ser Thr 


Asn 


Glu 


Gin 












725 




730 






735 






Ser 


Asn 


Phe 


Thr 


Ser He His Glu 


Gin 


Ser 


Glu His Gly Trp Trp Gly 










740 




745 






750 






45 


Ser 


Glu 


Asn 


He 


Thr He Gin Glu Gly Asn 


Asp Val Phe 


Lys 


Glu 


Asn 








755 




760 






765 










Tyr 


Val 


Thr 


Leu 


Pro Gly Thr Phe 


Asn 


Glu 


Cys Tyr Pro 


Thr 


Tyr 


Leu 




770 






775 






780 










Tyr 


Gin 


Lys 


He 


Gly Glu Ser Glu 


Leu 


Lys 


Ala Tyr Thr 


Arg 


Tyr 


Gin 


50 


785 








790 






795 






800 




Leu 


Arg 


Gly 


Tyr 


He Glu Asp Ser 
805 


Gin 


Asp 
810 


Leu Glu He 


Tyr 


Leu 
815 


He 




Arg 


Tyr 


Asn 


Ala 


Lys His Glu Thr 


Leu Asp 


Val Pro Gly Thr 


Glu 


Ser 






820 




825 






830 






55 


Val 


Trp 


Pro 


Leu 


Ser Val Glu Ser 


Pro 


He 


Gly Arg Cys 


Gly Glu 


Pro 






835 




840 






845 










Asn Arg 


Cys 


Ala 


Pro His Phe Glu Trp Asn 


Pro Asp Leu Asp 


Cys 


ser 
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850 855 860 





Cys 




AsD 


Gly Glu 


Lys 


Cys 


Ala 


His 


His 


Ser 


His 


His 


Phe 


Ser 


Leu 




DOS 








870 










875 










880 




Asp 


lie 


Asp 


He Gly 


Cys 


Thr 


Asp 


Leu 


His 


Glu 


Asn 


Leu 


Gly 


Val 


Trp 


3 






885 










890 










895 






vcix 


Val 


Phe 


Lys Xle 

-7 V 


Lys 


Thr 


Gin 


Glu 
905 


Gly 


His 


Ala 


Arg 


Leu 
910 


Gly 


Asn 




Leu 


Glu 


Phe 


He Glu 


Glu 


Lys 


Pro 
920 


Leu 


Leu 


Gly 


Glu 


Ala 
925 


Leu 


Ser 


Arg 


lU 


Val 


Lys 
930 




Ala Glu 


Lys 


Lys 
935 


Trp 


Arci 


Asp 


Lys 


Arg 
940 


Glu 


Lvs 


Leu 


Gin 




Leu 


Glu 


Thr 


Lys Arg 


Val 


Tyr 


Thr 


Glu 


Ala 


Lys 


Glu 


Ala 


Val 


Asp 


Ala 




945 








950 










955 










960 




Leu 


Phe 


Val 


Asp Ser 


Gin Tyr 


Asn 


Arg 


Leu 


Gin 


Ala 


Asp 


Thr 


Asn 


He 








965 










970 










975 






Gly 


Met 


He 


His Ala 


Ala Asp 


Lys 


Leu 


Val 


His 


Arg 


He 


Arg 


Glu 


Ala 








980 








985 










990 








Tyr 


Leu 


Ser 


Glu Leu 


Ser 


Val 


He 


Pro 


Gly 


Val 


Asn 


Ala 


Glu 


He 


Phe 






995 






1000 








1005 








20 


Glu 


Glu 


Leu 


Glu Gly 


Arg 


He 


He 


Thr 


Ala 


He 


Ser 


Leu 


Tyr 


Asp 


Ala 




1010 






1015 








1020 












Arg 


Asn 


Val 


Val Lys 


Asn Gly Asp 


Phe 


Asn 


Asn Gly Leu 


Ala 


Cys 


Trp 




1025 




1030 








1035 








1040 




Asn 


Val 


Lys 


Gly His Val Asp Val 


Gin 


Gin 


Ser 


His 


His 


Arg 


Ser 


Val 


25 








1045 








1050 








1055 






Leu 


Val 


He 


Pro Glu Trp 


Glu 


Ala 


Glu 


Val 


Ser 


Gin 


Ala 


Val 


Arg 


val 



1060 1065 1070 



Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly 
1075 1080 1085 

30 Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu Asn Asn Thr Asp 
1090 1095 1100 

Glu Leu Lys Phe Lys Asn Cys Glu Glu Glu Glu Val Tyr Pro Thr Asp 
1105 1110 1115 1120 

Thr Gly Thr Cys Asn Asp Tyr Thr Ala His Gin Gly Thr Ala Ala Cys 
35 1125 1130 1135 

Asn Ser Arg Asn Ala Gly Tyr Glu Asp Ala Tyr Glu Val Asp Thr Thr 

1140 1145 1150 

Ala Ser Val Asn Tyr Lys Pro Thr Tyr Glu Glu Glu Thr Tyr Thr Asp 
1155 1160 1165 

40 Val Arg Arg Asp Asn His Cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr 
1170 1175 1180 

Pro Pro Leu Pro Ala Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro 
1185 1190 1195 1200 

Glu Thr Asp Lys Val Trp He Glu He Gly Glu Thr Glu Gly Thr Phe 
45 1205 1210 1215 

He Val Asp Ser He Glu Leu Leu Leu Met Glu Glu 
1220 1225 



<210> 39 
<211> 3504 
<212> DNA 

<213> Bacillus thuringiensis 



<220> 
<221> CDS 
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<222> (1)..(3504) 
<400> 39 

atg gag aga aat aat cag gat caa tgc att cct tat aat tgt tta aat 
5 Met Glu Arg Asn Asn Gin Asp Gin Cys lie Pro Tyr Asn Cys Leu Asn 
15 10 15 

aat cct gag att gag ata tta gat gtt gaa aat ttc aat etc gaa ctt 
Asn Pro Glu lie Glu lie Leu Asp Val Glu Asn Phe Asn Leu Glu Leu 
10 20 25 30 



20 get eta ggc gtc gat caa tgg age tta ttc ctt gcg caa att gag caa 
Ala Leu Gly Val Asp Gin Trp Ser Leu Phe Leu Ala Gin lie Glu Gin 
65 70 75 80 

tta att aat gaa agg ata aca aca gtt gaa agg aat aga gca att caa 
25 Leu lie Asn Glu Arg lie Thr Thr Val Glu Arg Asn Arg Ala lie Gin 

85 90 95 



48 



96 



gta teg caa gtc agt gtg gga ctt aca cgt ttt ctt eta gag tea get 144 
Val Ser Gin Val Ser Val Gly Leu Thr Arg Phe Leu Leu Glu Ser Ala 
35 40 45 

15 

gtc cea gga gcg ggt ttt gca ett gge eta ttc gat ate att tgg gga 
Val Pro Gly Ala Gly Phe Ala Leu Gly Leu Phe Asp lie lie Trp Gly 
50 55 60 



192 



240 



288 



aca tta agt gga eta teg agt agt tat gaa gta tat att gag gca tta 336 
Thr Leu Ser Gly Leu Ser Ser Ser Tyr Glu Val Tyr lie Glu Ala Leu 
30 100 105 110 

aga gaa tgg gag aat aat eca gat aat cea get tea caa gag aga gtg 384 
Arg Glu Trp Glu Asn Asn Pro Asp Asn Pro Ala Ser Gin Glu Arg Val 
115 120 125 

35 

cgt* aca ega ttt cgt aca aeg gae gac get eta ata aca get ata cct 
Arg Thr Arg Phe Arg Thr Thr Asp Asp Ala Leu lie Thr Ala lie Pro 
130 135 140 

40 aat tta gcg att cea gat ttt gag ata get act tta tea gtg tat gtt 
Asn Leu Ala lie Pro Asp Phe Glu He Ala Thr Leu Ser Val Tyr Val 
145 150 155 160 

caa gca gee aat eta eat eta tct tta tta aga gat get gtt tae ttt 
45 Gin Ala Ala Asn Leu His Leu Ser Leu Leu Arg Asp Ala Val Tyr Phe 

165 170 175 

gga gaa aga tgg gga etc aca caa gta aat att gaa gat ctt tat acg 
Gly Glu Arg Trp Gly Leu Thr Gin Val Asn He Glu Asp Leu Tyr Thr 
50 180 185 190 

aga tta aca aga aat att cat att tat tea gat cat tgt gca agg tgg 624 
Arg Leu Thr Arg Asn He His He Tyr Ser Asp His Cys Ala Arg Trp 
195 200 205 

55 

tat aat caa ggt tta aat aat att gga gca aca aat acg aga tat ttg 672 
Tyr Asn Gin Gly Leu Asn Asn He Gly Ala Thr Asn Thr Arg Tyr Leu 



432 



480 



528 



576 
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35 ggt ttt att tat aga aca tta tea tec cct att tat gta oca cct tct 
Gly Phe lie Tyr Arg Thr Leu Ser Ser Pro lie Tyr Val Pro Pro Ser 
355 360 365 



720 



768 



-45- 



210 215 220 

gaa ttc caa aga gaa tta aca etc tct gtc tta gat att gtg gcc ctt 

Glu Phe Gin Arg Glu Leu Thr Leu Ser Val Leu Asp lie Val Ala Leu 

5 225 230 235 240 

ttc ccg aat tac gac ate ega aea tat tea att ecg aca caa agt caa 

Phe Pro Asn Tyr Asp lie Arg Thr Tyr Ser lie Pro Thr Gin Ser Gin 

245 250 255 

10 

tta aca agg gag att tat acc gat ata att get gca ccc aat gca tea 816 

Leu Thr Arg Glu lie Tyr Thr Asp lie lie Ala Ala Pro Asn Ala Ser 
260 265 270 

I 15 aat tta ata gtg gga acg caa ggc eta gtg aga gca cct cac tta atg 864 

Asn Leu lie Val Gly Thr Gin Gly Leu Val Arg Ala Pro His Leu Met 
275 280 285 

gac ttt tta gtc cgt ttg aat att tat act ggc ttg get aga aat att 912 

20 Asp Phe Leu Val Arg Leu Asn lie Tyr Thr Gly Leu Ala Arg Asn lie 
290 295 300 

egt cat tgg gca gga cat gaa gta ata tct aga aga aca ggt gga gtg 960 

Arg His Trp Ala Gly His Glu Val lie Ser Arg Arg Thr Gly Gly Val 

25 305 310 315 320 

gat tta aat act ata caa tct ect tta tat gga aea get gca act aca 1008 

Asp Leu Asn Thr lie Gin Ser Pro Leu Tyr Gly Thr Ala Ala Thr Thr 

325 330 335 

30 

gaa agt eca cgt tta ata att ect ttt aat gag gat tct tat ctt ggt 1056 

Glu Ser Pro Arg Leu lie lie Pro Phe Asn Glu Asp Ser Tyr Leu Gly 
340 345 350 



1104 



gga att teg agt caa aga aca tct tta gtg gag ggt gtg gga ttt cag 1152 

40 Gly He Ser Ser Gin Arg Thr Ser Leu Val Glu Gly Val Gly Phe Gin 

370 375 380 

aca ccg aat aac tea ata ctt caa tac aga caa cgt gga aca ttg gat 1200 

Thr Pro Asn Asn Ser He Leu Gin Tyr Arg Gin Arg Gly Thr Leu Asp 

45 385 390 395 400 

tee ctt gag caa gta eca ctt caa gaa gag ggg aga eca ggc ggt ttt 1248 

Ser Leu Glu Gin Val Pro Leu Gin Glu Glu Gly Arg Pro Gly Gly Phe 
405 410 415 

50 

ggt get agt cat aga ttg tgt eat get aca ttt get caa tea cct ata 1296 

Gly Ala Ser His Arg Leu Cys His Ala Thr Phe Ala Gin Ser Pro He 
420 425 430 

55 ggt act aac tat tat ata agg gca ecg ttg ttt tct tgg acg eat etg 1344 

Gly Thr Asn Tyr Tyr He Arg Ala Pro Leu Phe Ser Trp Thr His Leu 
435 440 445 
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agt gca act ctt act aat gaa gtt cgt gta tct aga att aca caa tta 1392 

Ser Ala Thr Leu Thr Asn Glu Val Arg Val Ser Arg He Thr Gin Leu 

450 455 460 

s 

ccg atg gtg aag gcg cat acg ctt cat gcg gga get act gtt gtt aga 1440 

Pro Met Val Lys Ala His Thr Leu His Ala Gly Ala Thr Val Val Arg 

465 470 475 480 

10 gga cca gga ttt aca gga gga gat ata etc cga aga act act tea ggc 1488 
Gly Pro Gly Phe Thr Gly Gly Asp He Leu Arg Arg Thr Thr Ser Gly 
485 490 495 

tea ttt ggg gat atg aga ata aca aat ttt tea agt tea tea teg agg 1536 
15 Ser Phe Gly Asp Met Arg He Thr Asn Phe Ser Ser Ser Ser Ser Arg 
500 505 510 

tat cgt gta aga ata cgt tat get tct act aca gat tta caa ttt tte 1584 
Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr Asp Leu Gin Phe Phe 
20 515 520 525 

ttg aat gtt gga gga ace cct gtc aat gta gcc gat ttc ccg aaa acc 1632 
Leu Asn Val Gly Gly Thr Pro Val Asn Val Ala Asp Phe Pro Lys Thr 
530 535 540 

25 

ata gat aga ggg gaa aac tta gaa tat gga age ttt aga acg gca ggt 1680 
He Asp Arg Gly Glu Asn Leu Glu Tyr Gly Ser Phe Arg Thr Ala Gly 
545 550 555 560 

30 ttt act ace cct ttt agt ttt gta agt tea aca aat aat tte aca tta 1728 
Phe Thr Thr Pro Phe Ser Phe Val Ser Ser Thr Asn Asn Phe Thr Leu 
565 570 575 

ggt gtt eag agt gtt tct tea ggt aac gag att ttt gta gat cga att 1776 
35 Gly Val Gin Ser Val Ser Ser Gly Asn Glu He Phe Val Asp Arg He -^^ 



580 585 590 

gaa ttt gtt ccg gca gat gca acc ttt gag gca gaa tat gat tta gaa 
Glu Phe Val Pro Ala Asp Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu 
40 595 600 605 

aga gcg caa gag gcg gtg aat get ctg ttt act tct acg aat caa aga 

Arg Ala Gin Glu Ala Val Asn Ala Leu Phe Thr Ser Thr Asn Gin Arg 

610 615 620 

45 

gga ctg aaa aca gat gtg acg gat tat eat att gat caa gtg tee aat 

Gly Leu Lys Thr Asp Val Thr Asp Tyr His He Asp Gin Val Ser Asn 

625 630 635 640 

50 tta gtg gat tgt tta tec gat gaa ttc tgt eta gat gaa aaa aga gaa 1968 
Leu Val Asp Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu 
645 650 655 



1824 



1872 



1920 



ttg tec gaa aaa att aaa cat gca aag cga etc agt gat gag egg aat 
55 Leu Ser Glu Lys He Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn 
660 665 670 



2016 
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tta etc caa gat tea aac ttt aga ggc ate aat aga caa cca gat cgt 
Leu Leu Gin Asp Ser Asn Phe Arg Gly lie Asn Arg Gin Pro Asp Arg 
675 680 685 



ttc aaa gaa aat tae gte aca eta cca ggt acc ttt gat gag tgc tat 
Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr 
705 710 715 720 

cca aca tat ttg tat caa aaa ate gat gaa tea aaa tta aaa gee ttt 
Pro Thr Tyr Leu Tyr Gin Lys lie Asp Glu Ser Lys Leu Lys Ala Phe 
725 730 735 



ate tat tta att cge tac aat gca aaa eat gaa aca gta aat gtg cca 
lie Tyr Leu lie Arg Tyr Asn Ala Lys His Glu Thr Val Asn Val Pro 
755 760 765 



gaa aaa ttg gaa tgg gaa aca aat ate gtt tat aaa gag gca aaa gaa 

Glu Lys Leu Glu Trp Glu Thr Asn lie Val Tyr Lys Glu Ala Lys Glu 
885 890 895 

tct gta gat get tta ttt gta aac tct caa tat gat caa tta caa geg 



2064 



ggc tgg aga gga agt aeg gat att act ate caa gga gga aat gac gta 2112 
Gly Trp Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asn Asp Val 
690 695 ' 700 



2160 



2208 



acc cgt tat caa tta aga ggg tat ate gaa gat agt caa gac tta gaa 2256 
Thr Arg Tyr Gin Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu 
740 745 750 



2304 



2400 



ggt aeg ggt tec tta tgg ccg ett tea gee caa agt cca ate gga aag 2352 

Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala Gin Ser Pro He Gly Lys 
770 775 780 

tgt gga gag ccg aat cga tgc gcg cca cac ctt gaa tgg aat cct gac 

Cys Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp Asn Pro Asp 
785 790 795 800 

tta gat tgt teg tgt agg gat gga gaa aag tgt gee cat cat teg cat 2448 

Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His 

805 810 815 

cat ttc tec tta gac att gat gta gga tgt aca gac tta aat gag gac 2496 
His Phe Ser Leu Asp He Asp Val Gly Cys Thr Asp Leu Asn Glu Asp 
820 825 830 

eta ggt gta tgg gtg ate ttt aag att aag aeg caa gat ggg cac gca 2544 
Leu Gly Val Trp Val He Phe Lys He Lys Thr Gin Asp Gly His Ala 
835 840 845 

aga eta ggg aat eta gag ttt etc gaa gag aaa cca tta gta gga gaa 2592 

Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Val Gly Glu 
850 855 860 

geg eta get cgt gtg aaa aga gcg gag aaa aaa tgg aga gac aaa cgt 2640 

Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg 
865 870 875 880 



2688 



2736 
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Ser Val TVsp Ala Leu Phe Val Asn Ser Gin Tyr Asp Gin Leu Gin Ala 
. 900 905 910 

gat acg aat att gcc atg att cat gcg gca gat aaa cgt gtt cat age 
Asp Thr Asn He Ala Met He His Ala Ala Asp Lys Arg Val His Ser 
915 920 925 



2784 



att cga gaa get tat ctg cct gag etg tct gtg att ccg ggt gtc aat 2832 
He Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val TVsn 
10 930 935 940 



gcg get att ttt gaa gaa tta gaa ggg cgt att ttc act gca ttc tec 
Ala Ala He Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala Phe Ser 
945 950 955 960 

IS 

eta tat gat gcg aga aat gtc att aaa aat ggt gat ttt aat aat ggc 
Leu Tyr Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn Asn Gly 
965 970 975 



2880 



2928 



20 tta tec tgc tgg aac gtg aaa ggg cat gta gat gta gaa gaa caa aac 2976 
Leu Ser Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gin Asn 
980 985 990 

aac caa cgt teg gtc ett gtt gtt ccg gaa tgg gaa gca gaa gtg tea 3024 
25 Asn Gin Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser 
995 1000 1005 

caa gaa gtt cgt gtc tgt ccg ggt cgt ggc tat ate ett cgt gtc aca 3072 
Gin Glu Val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr 
30 1010 1015 1020 

gcg tac aag gag gga tat gga gaa ggt tgc gta acc att cat gag ate 3120 

Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He His Glu He 
1025 1030 1035 1040 

35 

gag aac aat aca gac gaa ctg aag ttt age aac tgc gta gaa gag gaa 3168 

Glu Asn Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu Glu 
1045 1050 1055 

40 ate tat cca aat aac acg gta acg tgt aat gat tat act gta aat caa 3216 
He Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Thr Val Asn Gin 
1060 1065 1070 

gaa gaa tac gga ggt gcg tac act tct cgt aat cga gga tat aac gaa 3264 
45 Glu Glu Tyr Gly Gly Ala Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu 
1075 1080 1085 

get cct tec gta cca get gat tat gcg tea gtc tat gaa gaa aaa teg 3312 
Ala Pro Ser Val Pro Ala Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser 
50 1090 1095 1100 

tat aca gat gga cga aga gag aat cct tgt gaa ttt aac aga ggg tat 3360 

Tyr Thr Asp Gly Arg Arg Glu Asn Pro Cys Glu Phe Asn Arg Gly Tyr 

1105 1110 1115 1120 

55 

agg gat tac acg cca eta cca gtt ggt tat gtg aca aaa gaa tta gaa 3408 

Arg Asp Tyr Thr Pro Leu Pro Val Gly Tyr Val Thr Lys Glu Leu Glu 
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1125 1130 1135 

tac ttc cca gaa acc gat aag gta tgg att gag att gga gaa acg gaa 3456 

Tyr Phe Pro Glu Thr Asp Lys Val Trp lie Glu He Gly Glu Thr Glu 
1140 1145 1150 

gga aca ttt ate gtg gac age gtg gaa tta etc ctt atg gag gaa tag 3504 
Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1155 1160 1165 



<210> 40 
<211> 1167 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 40 

Met Glu Arg Asn Asn Gin Asp Gin Cys He Pro Tyr Asn Cys Leu Asn 

15 10 15 

Asn Pro Glu He Glu He Leu Asp Val Glu Asn Phe Asn Leu Glu Leu 

20 25 30 

val Ser Gin Val Ser Val Gly Leu Thr Arg Phe Leu Leu Glu Ser Ala 

35 40 45 

Val Pro Gly Ala Gly Phe Ala Leu Gly Leu Phe Asp He He Trp Gly 

50 55 60 

Ala Leu Gly Val Asp Gin Trp Ser Leu Phe Leu Ala Gin He Glu Gin 
65 70 75 80 

Leu He Asn Glu Arg He Thr Thr Val Glu Arg Asn Arg Ala He Gin 

85 90 95 

Thr Leu Ser Gly Leu Ser Ser Ser Tyr Glu Val Tyr He Glu Ala Leu 

100 105 110 

Arg Glu Trp Glu Asn Asn Pro Asp Asn Pro Ala Ser Gin Glu Arg Val 

115 120 125 

Arg Thr Arg Phe Arg Thr Thr Asp Asp Ala Leu He Thr Ala He Pro 

130 135 140 

Asn Leu Ala He Pro Asp Phe Glu He Ala Thr Leu Ser Val Tyr Val 
145 150 155 160 

Gin Ala Ala Asn Leu His Leu Ser Leu Leu Arg Asp Ala Val Tyr Phe 

165 170 175 

Gly Glu Arg Trp Gly Leu Thr Gin Val Asn He Glu Asp Leu Tyr Thr 

180 185 190 

Arg Leu Thr Arg Asn He His He Tyr Ser Asp His Cys Ala Arg Trp 

195 200 205 

Tyr Asn Gin Gly Leu Asn Asn He Gly Ala Thr Asn Thr Arg Tyr Leu 

210 215 220 

Glu Phe Gin Arg Glu Leu Thr Leu Ser Val Leu Asp He Val Ala Leu 
225 230 235 240 

Phe Pro Asn Tyr Asp He Arg Thr Tyr Ser He Pro Thr Gin Ser Gin 

245 250 255 

Leu Thr Arg Glu He Tyr Thr Asp He He Ala Ala Pro Asn Ala Ser 

260 265 270 

Asn Leu He Val Gly Thr Gin Gly Leu Val Arg Ala Pro His Leu Met 

275 280 285 

Asp Phe Leu Val Arg Leu Asn He Tyr Thr Gly Leu Ala Arg Asn He 

290 295 300 

Arg His Trp Ala Gly His Glu Val He Ser Arg Arg Thr Gly Gly Val 
305 310 315 320 
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Asp Leu Asn Thr lie Gin Ser Pro Leu Tyr Gly Thr Ala Ala Thr Thr 

325 330 335 

Glu Ser Pro Arg Leu lie lie Pro Phe Asn Glu Asp Ser Tyr Leu Gly 
340 345 350 

5 Gly Phe He Tyr Arg Thr Leu Ser Ser Pro He Tyr Val Pro Pro Ser 
355 360 365 

Gly He Ser Ser Gin Arg Thr Ser Leu Val Glu Gly Val Gly Phe Gin 

370 375 380 

Thr Pro Asn Asn Ser He Leu Gin Tyr Arg Gin Arg Gly Thr Leu Asp 
10 385 390 395 400 

Ser Leu Glu Gin Val Pro Leu Gin Glu Glu Gly Arg Pro Gly Gly Phe 

405 410 415 

Gly Ala Ser His Arg Leu Cys His Ala Thr Phe Ala Gin Ser Pro He 
420 425 430 

15 Gly Thr Asn Tyr Tyr He Arg Ala Pro Leu Phe Ser Trp Thr His Leu * 
435 440 445 

Ser Ala Thr Leu Thr Asn Glu Val Arg Val Ser Arg He Thr Gin Leu 

450 455 460 

Pro Met Val Lys Ala His Thr Leu His Ala Gly Ala Thr Val Val Arg 
20 465 470 475 480 

Gly Pro Gly Phe Thr Gly Gly Asp He Leu Arg Arg Thr Thr Ser Gly 

485 490 495 

Ser Phe Gly Asp Met Arg He Thr Asn Phe Ser Ser Ser Ser Ser Arg 
500 505 510 

25 Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr Asp Leu Gin Phe Phe 
515 520 525 

Leu Asn Val Gly Gly Thr Pro Val Asn Val Ala Asp Phe Pro Lys Thr 

530 535 540 

He Asp Arg Gly Glu Asn Leu Glu Tyr Gly Ser Phe Arg Thr Ala Gly 
30 545 550 555 560 

Phe Thr Thr Pro Phe Ser Phe Val Ser Ser Thr Asn Asn Phe Thr Leu 

565 570 575 

Gly Val Gin Ser Val Ser Ser Gly Asn Glu He Phe Val Asp Arg He 
580 585 590 

35 Glu Phe Val Pro Ala Asp Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu 

595 600 605 ^ 

Arg Ala Gin Glu Ala Val Asn Ala Leu Phe Thr Ser Thr Asn Gin Arg 

610 615 620 

Gly Leu Lys Thr Asp Val Thr Asp Tyr His He Asp Gin Val Ser Asn 
40 625 630 635 640 

Leu Val Asp Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu 

645 650 655 

Leu Ser Glu Lys He Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn 
660 665 670 

45 Leu Leu Gin Asp Ser Asn Phe Arg Gly He Asn Arg Gin Pro Asp Arg 
675 680 685 

Gly Trp Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asn Asp Val 

690 695 700 

Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr 
50 705 710 715 720 

Pro Thr Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu Lys Ala Phe 

725 730 735 

Thr Arg Tyr Gin Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu 
740 745 750 

55 He Tyr Leu He Arg Tyr Asn Ala Lys His Glu Thr Val Asn Val Pro 
755 760 765 

Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala Gin Ser Pro He Gly Lys 
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770 775 780 

Cys Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp Asn Pro Asp 
785 790 795 800 

Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser Hxs 
5 805 810 815 

His Phe Ser Leu Asp lie Asp Val Gly Cys Thr Asp Leu Asn Glu Asp 

820 825 830 

Leu Gly Val Trp Val He Phe Lys He Lys Thr Gin Asp Gly His Ala 
835 840 845 

10 Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Val Gly Glu 
850 855 860 

Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg 
865 870 875 880 

Glu Lys Leu Glu Trp Glu Thr Asn He Val Tyr Lys Glu Ala Lys Glu 
; 15 885 890 895 

Ser Val Asp Ala Leu Phe Val Asn Ser Gin Tyr Asp Gin Leu Gin Ala 

900 ^05 910 

Asp Thr Asn He Ala Met He His Ala Ala Asp Lys Arg Val His Ser 
915 920 925 

20 He Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val Asn 
930 935 940 

Ala Ala He Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala Phe Ser 
945 950 955 960 

Leu Tyr Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn Asn Gly 
25 965 970 975 

Leu ser Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gin Asn 

980 985 990 

Asn Gin Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser 
995 1000 1005 

30 Gin Glu val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr 
1010 1015 1020 

Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He His Glu He 
1025 1030 1035 1040 

Glu Asn Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu Glu 
35 1045 1050 1055 

He Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Thr Val Asn Gin 

1060 1065 1070 

Glu Glu Tyr Gly Gly Ala Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu 
1075 1080 1085 

40 Ala Pro Ser Val Pro Ala Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser 
1090 1095 1100 

Tyr Thr Asp Gly Arg Arg Glu Asn Pro Cys Glu Phe Asn Arg Gly Tyr 
1105 HIO HIS 1120 

Arg Asp Tyr Thr Pro Leu Pro Val Gly Tyr Val Thr Lys Glu Leu Glu 
45 1125 1130 1135 

Tyr Phe Pro Glu Thr Asp Lys Val Trp He Glu He Gly Glu Thr Glu 

1140 1145 1150 

Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1155 1160 1165 

50 



<210> 41 
<211> 2133 
55 <212> DMA 

<213> Bacillus 



thuringiensis 
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<220> 

<221> CDS 

<222> (1)..(2133) 

5 <400> 41 

atg aaa tct aag aat caa aat atg cat caa age ttg tct aac aat gcg 48 
Met Lys Ser Lys Asn Gin Asn Met His Gin Ser Leu Ser Aan Asn Ala 
15 10 15 

10 aca gtt gat aaa aac ttt aca ggt tea eta gaa aat aac aca aat acg 96 
Thr Val Asp Lys Asn Phe Thr Gly Ser Leu Glu Asn Asn Thr Asn Thr 
20 25 30 

gaa tta caa aac ttt aat cat gaa ggt ata gag ccg ttt gtt agt gta 144 
15 Glu Leu Gin Asn Phe Asn His Glu Gly lie Glu Pro Phe Val Ser Val 
35 40 45 

tea aca att caa acg ggt att ggt att get ggt aaa ate ctt ggt aac 192 
Ser Thr lie Gin Thr Gly He Gly He Ala Gly Lys He Leu Gly Asn 
20 50 55 60 

eta ggc gtt ect ttt get ggg caa gta get age etc tat agt ttt ate 240 
Leu Gly Val Pro Phe Ala Gly Gin Val Ala Ser Leu Tyr Ser Phe He 
65 70 75 80 

25 

eta ggt gag ctt tgg ccc aaa ggg aaa age caa tgg gaa ate ttt atg 288 
Leu Gly Glu Leu Trp Pro Lys Gly Lys Ser Gin Trp Glu He Phe Met 
85 90 95 

30 gaa eat gta gaa gag ctt att aat caa aag ata teg act tat gca aga 336 
Glu His Val Glu Glu Leu He Asn Gin Lys He Ser Thr Tyr Ala Arg 
100 105 110 

aac aaa gca ctt gca gat tta aaa gga tta gga gat get ttg get gte 384 
35 Asn Lys Ala Leu Ala Asp Leu Lys Gly Leu Gly Asp Ala Leu Ala Val 
115 120 125 

tac eat gaa teg ctg gaa agt tgg att gaa aat cgc aat aac aca aga 432 
Tyr His Glu Ser Leu Glu Ser Trp He Glu Asn Arg Asn Asn Thr Arg 
40 130 135 140 

acc aga agt gtt gte aag age caa tac ate aec ttg gaa ctt atg ttc 480 
Thr Arg Ser Val Val Lys Ser Gin Tyr He Thr Leu Glu Leu Met Phe 
145 150 155 160 

45 

gta caa tea tta cct tct ttt gca gtg tct gga gag gaa gta eca eta 528 
Val Gin Ser Leu Pro Ser Phe Ala Val Ser Gly Glu Glu Val Pro Leu 
165 170 175 

50 tta cea ata tat get caa get gca aat tta cae tta ttg eta tta ega 576 
Leu Pro He Tyr Ala Gin Ala Ala Asn Leu His Leu Leu Leu Leu Arg 
180 185 190 

gat get tct att ttt gga aaa taa tgg ggg tta tea gae tea gaa att 624 
55 Asp Ala Ser He Phe Gly Lys Xaa Trp Gly Leu Ser Asp Ser Glu He 
195 200 205 
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10 



tec aca ttt tat aat cgc caa tec gga aaa teg aaa gaa tat tct gac 
Ser Thr Phe Tyr Asn Arg Gin Ser Gly Lys Ser Lys Glu Tyr Ser Asp 
210 215 220 

cac tgc gta aaa tgg tat aat aca ggc eta aat cgc ttg atg ggg aac 
His Cys Val Lys Trp Tyr Asn Thr Gly Leu Asn Arg Leu Met Gly Asn 
225 230 235 240 

aat gee gaa agt tgg gta ega tat aat caa ttc cgt aga gac atg act 
Asn Ala Glu Ser Trp Val Arg Tyr Asn Gin Phe Arg Arg Asp Met Thr 
245 250 255 



20 ^ 

gac gea att ggg aca gta eat ccg cat cca agt ttt aca agt acg act 

Asp Ala lie Gly Thr Val His Pro His Pro Ser Phe Thr Ser Thr Thr 

290 295 300 

tgg tat aat aat aat gca cct teg ttc tct ace ata gag get get gtt 
Trp Tyr Asn Asn Asn Ala Pro Ser Phe Ser Thr He Glu Ala Ala Val 
305 310 315 320 

gtt cga aac ccg cat eta etc gat ttt eta gaa caa gtt aca att tac 
Val Arg Asn Pro His Leu Leu Asp Phe Leu Glu Gin Val Thr He Tyr 
325 330 335 

age tta tta agt cga tgg agt aac act cag tat atg aat atg tgg gga 
Ser Leu Leu Ser Arg Trp Ser Asn Thr Gin Tyr Met Asn Met Trp Gly 
35 340 345 350 

gga cat aaa eta gaa ttc cga aca ata gga gga acg tta aat acc tea 
Gly His Lys Leu Glu Phe Arg Thr He Gly Gly Thr Leu Asn Thr Ser 
355 360 365 



25 



30 



40 



45 



50 



55 



aca caa gga tct act aat act tct att aat cct gta aca tta ccg ttc 
Thr Gin Gly Ser Thr Asn Thr Ser He Asn Pro Val Thr Leu Pro Phe 
370 375 380 

act tct cga gac gtc tat agg act gaa tea ttg gea ggg etg aat eta 
Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Leu Ala Gly Leu Asn Leu 
385 390 395 400 

ttt tta act caa cct gtt aat gga gta cct agg gtt gat ttt cat tgg 
Phe Leu Thr Gin Pro Val Asn Gly Val Pro Arg Val Asp Phe His Trp 
405 410 415 

aaa ttc gtc aca cat ccg ate gca tct gat aat ttc tat tat eca ggg 
Lys Phe Val Thr His Pro He Ala Ser Asp Asn Phe Tyr Tyr Pro Gly 
420 425 430 

tat get gga att ggg acg caa tta cag gat tea gaa aat gaa tta cca 



672 



720 



768 



816 



tta atg gta eta gat tta gtg gca eta ttt eca age tat gat aca caa 
Leu Met val Leu Asp Leu Val Ala Leu Phe Pro Ser Tyr Asp Thr Gin 
; 15 260 265 270 

atg tat cca att aaa act aca gee caa ett aca aga gaa gta tat aca 864 
Met Tyr Pro He Lys Thr Thr Ala Gin Leu Thr Arg Glu Val Tyr Thr 
275 280 285 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 
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Tyr Ala Gly lie Gly Thr Gin Leu Gin Asp Ser Glu Asn Glu lieu Pro 
435 440 445 

cct gaa gca aca gga cag cca aat tat gaa tct tat agt cat aga tta 1392 
5 Pro Glu Ala Thr Gly Gin Pro Asn Tyr Glu Ser Tyr Ser His Arg Leu 
450 455 460 

tct cat ata gga etc att tea gea tea cat gtg aaa gea ttg gta tat 1440 
Ser His lie Gly Leu lie Ser Ala Ser His Val Lys Ala Leu Val Tyr 
10 465 470 475 480 

tct tgg acg cat cgt agt gca gat cgt aca aat aca att gag cca aat 1488 
Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn Thr lie Glu Pro Asn 
485 490 495 

IS 

age att aca caa ata eca tta gta aaa gcg ttc aat ctg tct tea ggt 1536 
Ser lie Thr Gin lie Pro Leu Val Lys Ala Phe Asn Leu Ser Ser Gly 
500 505 510 

20 gee get gta gtg aga gga cca gga ttt aca ggt ggg gat ate ctt ega 1584 
Ala Ala Val Val Arg Gly Pro Gly Phe Thr Gly Gly Asp lie Leu Arg 
515 520 525 

aga aag aat act ggt aca ttt ggg gat ata cga gta aat att aat cca 1632 
25 Arg Lys Asn Thr Gly Thr Phe Gly Asp He Arg Val Asn lie Asn Pro 
530 535 540 

cca ttt gca caa aga tat egc gtg agg att egc tat get tct acc aca 1680 
Pro Phe Ala Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr 
30 545 550 555 560 

gat tta caa ttc cat acg tea att aac ggt aaa get att aat caa ggt 1728 
Asp Leu Gin Phe His Thr Ser lie Asn Gly Lys Ala He Asn Gin Gly 
565 570 575 

35 

aat ttt tea gea act atg aat aga gga gag gae tta gae tat aaa ace 1776 
Asn Phe Ser Ala Thr Met Asn Arg Gly Glu Asp Leu Asp Tyr Lys Thr 
580 585 590 

40 ttt aga act gta ggc ttt acc acc cca ttt age ttt tea gat gta caa 1824 
Phe Arg Thr Val Gly Phe Thr Thr Pro Phe Ser Phe Ser Asp Val Gin 
595 600 605 

agt aca ttc aca ata ggt get tgg aac ttc tct tea ggt aac gaa gtt 1872 
45 Ser Thr Phe Thr He Gly Ala Trp Asn Phe Ser Ser Gly Asn Glu Val 
610 615 620 

tat ata gat aga att gaa ttt gtt ccg gta gaa gta aca tat gag gca 1920 
Tyr He Asp Arg He Glu Phe Val Pro Val Glu Val Thr Tyr Glu Ala 
50 625 630 635 640 

gaa tat gat ttt gaa aaa gcg caa gag gag gtt act gca ctg ttt aca 1968 
Glu Tyr Asp Phe Glu Lys Ala Gin Glu Glu Val Thr Ala Leu Phe Thr 
645 650 655 

55 

tct acg aat cca aga gga tta aaa aca gat gta aag gat tat cat att 2016 
Ser Thr Asn Pro Arg Gly Leu Lys Thr Asp Val Lys Asp Tyr His He 
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).5 



20 



50 



55 



660 



.55- 



665 670 



gac cag gta tea aat tta gta gag tct eta tea gat aaa ttc tat ctt 
Asp Gin Val Ser Asn Leu Val Glu Ser Leu Ser Asp Lys Phe Tyr Leu 
675 680 685 

gat gaa aag aga gaa tta ttc gag ata gtt aaa tac gcg aag caa etc 
Asp Glu Lys Arg Glu Leu Phe Glu He Val Lys Tyr Ala Lys Gin Leu 
690 695 700 

eat att gag cgt aac atg tag 
His He Glu Arg Asn Met 
705 710 



<210> 42 
<211> 710 
<212> PRT 

<213> Bacillus thuringiensis 



<400> 42 

Met Lys Ser Lys Asn Gin Asn Met His Gin Ser Leu Ser Asn Asn Ala 

1 5 10 15 

Thr Val Asp Lys Asn Phe thr Gly Ser Leu Glu Asn Asn Thr Asn Thr 
25 20 25 30 

Glu Leu Gin Asn Phe Asn His Glu Gly He Glu Pro Phe Val Ser Val 

35 40 . 45 . 

Ser Thr He Gin Thr Gly He Gly He Ala Gly Lys He Leu Gly Asn 
50 55 60 

30 Leu Gly Val Pro Phe Ala Gly Gin Val Ala Ser Leu Tyr Ser Phe He 
65 70 75 80 

Leu Gly Glu Leu Trp Pro Lys Gly Lys Ser Gin Trp Glu He Phe Met 

85 90 95 

Glu His Val Glu Glu Leu He Asn Gin Lys He Ser Thr Tyr Ala T^g 
35 100 105 110 

Asn Lys Ala Leu Ala Asp Leu Lys Gly Leu Gly Asp Ala Leu Ala Val 

115 120 125 

Tyr His Glu Ser Leu Glu Ser Trp He Glu Asn Arg Asn Asn Thr Arg 
130 135 140 

40 Thr Arg Ser Val Val Lys Ser Gin Tyr He Thr Leu Glu Leu Met Phe 
145 150 155 160 

Val Gin Ser Leu Pro Ser Phe Ala Val Ser Gly Glu Glu Val Pro Leu 

165 170 175 

Leu Pro He Tyr Ala Gin Ala Ala Asn Leu His Leu Leu Leu Leu Arg 
45 IBO 185 190 

Asp Ala Ser He Phe Gly Lys Xaa Trp Gly Leu Ser Asp Ser Glu He 

195 200 205 

Ser Thr Phe Tyr Asn Arg Gin Ser Gly Lys Ser Lys Glu Tyr Ser Asp 

210 215 220 

His Cys Val Lys Trp Tyr Asn Thr Gly Leu Asn Arg Leu Met Gly Asn 
225 230 235 240 

Asn Ala Glu Ser Trp Val Arg Tyr Asn Gin Phe Arg Arg Asp Met Thr 

245 250 255 

Leu M t Val Leu Asp Leu Val Ala Leu Phe Pro Ser Tyr Asp Thr Gin 

260 265 270 

Met Tyr Pro He Lys Thr Thr Ala Gin Leu Thr Arg Glu Val Tyr Thr 
275 280 285 



2064 



2112 



2133 
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Asp Ala He Gly Thr Val His Pro His Pro Ser Phe Thr Ser Thr Thr 

290 295 300 

Trp Tyr Asn Asn Asn Ala Pro Ser Phe Ser Thr He Glu Ala Ala Val 
305 310 315 320 

5 Val Arg Asn Pro His Leu Leu Asp Phe Leu Glu Gin Val Thr He Tyr 

325 330 335 

Ser Leu Leu Ser Arg Trp Ser Asn Thr Gin Tyr Met Asn Met Trp Gly 

340 345 350 

Gly His Lys Leu Glu Phe Arg Thr He Gly Gly Thr Leu Asn Thr Ser 
JO 355 360 365 

Thr Gin Gly Ser Thr Asn Thr Ser He Asn Pro Val Thr Leu Pro Phe 

370 375 380 

Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Leu Ala Gly Leu Asn Leu 
385 390 395 400 

15 Phe Leu Thr Gin Pro Val Asn Gly Val Pro Arg Val Asp Phe His Trp 

405 410 415 

Lys Phe Val Thr His Pro He Ala Ser Asp Asn Phe Tyr Tyr Pro Gly 

420 425 430 

Tyr Ala Gly He Gly Thr Gin Leu Gin Asp Ser Glu Asn Glu Leu Pro 
20 435 440 445 

Pro Glu Ala Thr Gly Gin Pro Asn Tyr Glu Ser Tyr Ser His Arg Leu 

450 455 460 

Ser His He Gly Leu He Ser Ala Ser His Val Lys Ala Leu Val Tyr 
465 470 475 480 

25 Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn Thr He Glu Pro Asn 

485 490 495 

Ser He Thr Gin He Pro Leu Val Lys Ala Phe Asn Leu Ser Ser Gly 

500 505 510 

Ala Ala Val Val Arg Gly Pro Gly Phe Thr Gly Gly Asp He Leu Arg 
30 515 520 525 

Arg Lys Asn Thr Gly Thr Phe Gly Asp He Arg Val Asn He Asn Pro 

530 535 540 

Pro Phe Ala Gin Arg Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr 
545 550 555 560 

35 Asp Leu Gin Phe His Thr Ser He Asn Gly Lys Ala He Asn Gin Gly 

565 570 575 

Asn Phe Ser Ala Thr Met Asn Arg Gly Glu Asp Leu Asp Tyr Lys Thr 

580 585 590 

Phe Arg Thr Val Gly Phe Thr Thr Pro Phe Ser Phe Ser Asp Val Gin 
40 5 9 5 6 0 0 6 05 

Ser Thr Phe Thr He Gly Ala Trp Asn Phe Ser Ser Gly Asn Glu Val 

610 615 620 

Tyr He Asp Arg He Glu Phe Val Pro Val Glu Val Thr Tyr Glu Ala 
625 630 635 640 

45 Glu Tyr Asp Phe Glu Lys Ala Gin Glu Glu Val Thr Ala Leu Phe Thr 

645 650 655 

Ser Thr Asn Pro Arg Gly Leu Lys Thr Asp Val Lys Asp Tyr His He 

660 665 670 

Asp Gin Val Ser Asn Leu Val Glu Ser Leu Ser Asp Lys Phe Tyr Leu 
50 675 680 685 

Asp Glu Lys Arg Glu Leu Phe Glu He Val Lys Tyr Ala Lys Gin Leu 

690 695 700 

His He Glu Arg Asn Met 
705 710 

55 
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<210> 43 
<211> 218 
<212> DNA 

<213> Bacillus thuringiensis 

5 

<400> 43 

gtagccgatt tcccgaaaac catagataga 
acggcaggtt ttactacccc ttttagtttt 
gttcagagtg tttcttcagg taacgagatt 
10 gatgcaacct ttgaggcaga atatgattta 



ggggaaaact tagaatatgg aagctttaga 60 

gtaagttcaa caaataattt cacattaggt 120 

tttgtagatc gaattgaatt tgttccggca 180 

gaaagagc 218 



<210> 44 
<211> 72 
7 15 <212> PRT 

c213> Bacillus thuringiensis 

<400> 44 

Val Ala Asp Phe Pro Lys Thr lie Asp Arg Gly Glu Asn Leu Glu Tyr 
20 1 5 10 15 

Gly Ser Phe Arg Thr Ala Gly Phe Thr Thr Pro Phe Ser Phe Val Ser 
20 25 30 

25 Ser Thr Asn Asn Phe Thr Leu Gly Val Gin Ser Val Ser Ser Gly Asn 
35 40 45 

Glu lie Phe Val Asp Arg He Glu Phe Val Pro Ala Asp Ala Thr Phe 
50 55 60 

30 

Glu Ala Glu Tyr Asp Leu Glu Arg 
65 70 



35 <210> 45 

<211> 1908 
<212> DNA 

<213> Bacillus thuringiensis 

40 <220> 

<221> CDS 

<222> (1) . . (1908) 

<400> 45 

45 atg aat aat gta ttg aat age gga aaa aca act att tgt aat gcg tat 48 

Met Asn Asn Val Leu Asn Ser Gly Lys Thr Thr He Cys Asn Ala Tyr 

1 5 10 . 15 

aat gta gtg get cac gat cca ttt agt ttt gaa cat aaa tea tta gat 96 
50 Asn Val Val Ala His Asp Pro Phe Ser Phe Glu His Lys Ser Leu Asp 
20 25 30 

acc ate caa gaa gaa tgg atg gag tgg aaa aga aca gat eat agt tta 144 
Thr He Gin Glu Glu Trp Met Glu Trp Lys Arg Thr Asp His Ser Leu 
55 35 40 45 



tat gta get cet gta gte gga act gtg tct agt ttt ctg eta aag aaa 



192 
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Tyr Val Ala Pro Val Val Gly Thr Val Ser Ser Phe Leu Leu Lys Lys 
50 55 60 

gtg ggg agt eta att gga aaa agg ata ttg agt gaa tta tgg ggg tta 240 
5 Val Gly Ser Leu lie Gly Lys Arg lie Leu Ser Glu Leu Trp Gly Leu 
65 70 75 80 

ata ttt cct agt ggt agt aca aat eta atg caa gat att tta aga gag 288 
lie Phe Pro Ser Gly Ser Thr Asn Leu Met Gin Asp lie Leu Arg Glu 
10 85 90 95 

aca gaa caa ttc eta aat caa aga ett aat aca gac acc ctt gat cgt 336 
Thr Glu Gin Phe Leu Asn Gin Arg Leu Asn Thr Asp Thr Leu Asp Arg 
100 105 110 

ts 

gta aat gca gaa ttg gaa ggg etc caa geg aat ata agg gag ttt aat 384 
Val Asn Ala Glu Leu Glu Gly Leu Gin Ala Asn lie Arg Glu Phe Asn 
115 120 125 

20 eaa caa gta gat aat ttt tta aac cct act caa aac cct gtt cct tta 432 
Gin Gin Val Asp Asn Phe Leu Asn Pro Thr Gin Asn Pro Val Pro Leu 
130 135 140 

tea ata act tet tea gtt aat aca atg cag eaa tta ttt eta aat aga 480 
25 Ser lie Thr Ser Ser Val Asn Thr Met Gin Gin Leu Phe Leu Asn Arg 
145 150 155 160 

tta ccc cag ttc cag ata caa gga tac cag ttg tta tta tta cct tta 528 
Leu Pro Gin Phe Gin He Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 
30 165 170 175 

ttt gca cag gca gee aat atg eat ctt tet ttt att aga gat gtt att 576 
Phe Ala Gin Ala Ala Asn Met His Leu Ser Phe He Arg Asp Val He 
180 185 190 

35 

ctt aat gca gat gaa tgg ggc att tea gca gca- aca eta cgt aeg tat 624 
Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Leu Arg Thr Tyr 
195 200 205 

40 ega gac tac etg aga aat tat aca aga gat tat tet aat tat tgt ata 672 
Arg Asp Tyr Leu Arg Asn Tyr Thr Arg Asp Tyr Ser Asn Tyr Cys He 
210 215 220 

aat aeg tat eaa act gcg ttt aga ggg tta aac acc cgt tta cac gat 720 
45 Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His Asp 
225 230 235 240 

atg tta gaa ttt aga aca tat atg ttt tta aat gta ttt gaa tat gta 768 
Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 
50 245 250 255 

tec att tgg tea ttg ttt aaa tat cag agt ctt atg gta tet tet ggc 816 
Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Met Val Ser Ser Gly 
260 265 270 

55 

get aat tta tat get agt ggt agt gga eca cag cag aca eaa tea ttt 864 
Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Gin Gin Thr Gin Ser Phe 
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275 280 285 

act gca caa aac tgg cca ttt tta tat tct ctt ttc caa gtt aat teg 912 
Thr Ala Gin Asn Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser 
5 290 295 300 

aat tat ata tta tct ggt att agt ggt aat agg ctt tct act acc ttc 960 
Asn Tyr lie Leu Ser Gly He Ser Gly Asn Arg Leu Ser Thr Thr Phe 
305 310 315 320 

10 

cct aat att ggt ggt tta ccg ggt agt act aca att cat tea ttg aac 1008 
Pro Asn He Gly Gly Leu Pro Giy Ser Thr Thr lie His Ser Leu Asn 
325 330 335 

) 15 agt gcc agg gtt aat tat age gga gga gtt tea tct ggt etc ata ggg 1056 
Ser Ala Arg Val Asn Tyr Ser Gly Gly Val Ser Ser Gly Leu He Gly 
340 345 350 

gcg act aat etc aat cac aac ttt aat tgc age acg gtc etc cct cct 1104 
20 Ala Thr Asn Leu Asn His Asn Phe Asn Cys Ser Thr Val Leu Pro Pro 
355 360 365 

tta tea aca cca ttt gtt aga agt tgg ctg gat tea ggt aca gat ega 1152 
Leu Ser Thr Pro Phe Val Arg Ser Trp Leu Asp Ser Gly Thr Asp Arg 
25 370 375 380 

gag gge gtt get acc tct acg act tgg cag aca gaa tec ttc caa ata 1200 
Glu Gly Val Ala Thr Ser Thr Thr Trp Gin Thr Glu Ser Phe Gin He 
385 390 395 400 

30 

act tea ggt tta agg tgt ggt get ttt cct ttt tea get egt gga aat 1248 
Thr Ser Gly Leu Arg Cys Gly Ala Phe Pro Phe Ser Ala Arg Gly Asn 
405 410 415 

35 tea aac tat ttc cca gat tat ttt ate cgt aat att tct ggg gtt cct 1296 
' Ser Asn Tyr Phe Pro Asp Tyr Phe He Arg Asn He Ser Gly Val Pro 
420 425 430 

tta gtt att aga aac gaa gat eta aca aga ccg tta cac tat aac caa 1344 
40 Leu Val He Arg Asn Glu Asp Leu Thr Arg Pro Leu His Tyr Asn Gin 
435 440 445 

ata aga aat ata gaa agt cct teg gga aca cct ggt gga tta cga get 1392 
He Arg Asn He Glu Ser Pro Ser Gly Thr Pro Gly Gly Leu Arg Ala 
45 450 455 460 

tat atg gta tct gtg cat aac aga aaa aat aat ate tat gcc get cat 1440 
Tyr Met Val Ser Val His Asn Arg Lys Asn Asn He Tyr Ala Ala His 
465 470 475 480 

SO 

gaa aat ggt act atg att cat ttg gca ccg gaa gat tat aca gga ttt 1488 
Glu Asn Gly Thr Met He His Leu Ala Pro Glu Asp Tyr Thr Gly Phe 
485 490 495 

55 act ata tea cca ata eat gcc act caa gtg aat aat caa act cga aca 1536 
Thr He Ser Pro He His Ala Thr Gin Val Asn Asn Gin Thr Arg Thr 
500 505 510 



wo 01/19859 



PCTAJSOO/25361 



-60- 



ttt att tct gaa aaa ttt gga aat caa ggt gat tec tta aga ttt gaa 1584 
Phe He Ser Glu Lys Phe Gly Asn Gin Gly Asp Ser Leu Arg Phe Glu 
515 520 525 

caa agt aac acg aca get cgt tat acg ctt aga ggg aat gga aat agt 1632 
Gin Ser Asn Thr Thr Ala Arg Tyr Thr Leu Arg Gly Asn Gly Asn Ser 
530 535 540 

10 tac aat ctt tat tta aga gta tct tea ata gga aat tea act ate cga 1680 
Tyr Asn Leu Tyr Leu Arg Val Ser Ser He Gly Asn Ser Thr He Arg 
545 550 555 560 

gtt act ata aac ggt agg gtt tat act get tea aat gtt aat act aat 1728 
15 Val Thr He Asn Gly Arg Val Tyr Thr Ala Ser Asn Val Asn Thr Asn - 

565 570 575 

aca aat aac gat ggg gtt aat gat aat gga get cgt ttt tea gat att 1776 
Thr Asn Asn Asp Gly Val Asn Asp Asn Gly Ala Arg Phe Ser Asp He 
20 580 585 590 

aat ate ggt aat gta gta gea agt gat aat act aat gta ccg tta gat 1824 
Asn He Gly Asn val Val Ala Ser Asp Asn Thr Asn Val Pro Leu Asp 
595 600 605 

25 

ata aat gtg aca tta aac tec ggt act caa ttt gag ctt atg aat att 1872 
He Asn Val Thr Leu Asn Ser Gly Thr Gin Phe Glu Leu Met Asn He 
610 615 620 

30 atg ttt gtg eca act aat ctt eca cea ctt tat taa 1908 
Met Phe Val Pro Thr Asn Leu Pro Pro Leu Tyr 
625 630 635 



35 <210> 46 ^ 
<211> 635 
<212> PRT 

<213> Bacillus thuringiensis 



40 <400> 46 
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425 




Leu 


Val 


He 


Arg 


Asn Glu Asp 


Leu 


Thr 
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Asn He Gly Asn Val Val Ala Ser Asp Asn Thr Asn Val Pro Leu Asp 

595 600 605 

He Asn Val Thr Leu Asn Ser Gly Thr Gin Phe Glu Leu Met Asn He 
610 615 620 

5 Met Phe Val Pro Thr Asn Leu Pro Pro Leu Tyr 
625 630 635 



10 <210> 47 

<211> 1878 
<212> DNA 

<213> Bacillus thuringiensis 

15 <220> 

<221> CDS 

<222> (1) . . (1878) 

<400> 47 

20 atg aat act gta ttg aat aac gga aga aat act act tgt cat gca cat : 48 
Met Asn Thr Val Leu Asn Asn Gly Arg Asn Thr Thr Cys His Ala His 
15 10 15 

aat gta gtt get cat gat cca ttt agt ttt gaa cat aaa tea tta aat 96 
25 Asn Val Val Ala His Asp Pro Phe Ser Phe Glu His Lys Ser Leu Asn 
20 25 30 



acc ata gaa aaa gaa tgg aaa gaa tgg aaa aga act gat cat agt tta 
Thr He Glu Lys Glu Trp Lys Glu Trp Lys Arg Thr Asp His Ser Leu 
30 35 40 45 



144 



tat gta gcc cct att gtg gga act gtg ggt agt ttt eta tta aag aaa 192 
Tyr Val Ala Pro He Val Gly Thr Val Gly Ser Phe Leu Leu Lys Lys 
50 55 60 



240 * 



35 ^% 
gta ggg agt ctt gtt gga aaa agg ata ctg agt gag tta cag aat tta 
Val Gly Ser Leu Val Gly Lys Arg He Leu Ser Glu Leu Gin Asn Leu 
65 70 75 80 



40 att ttt cct agt ggt agt ata gat tta atg caa gag att tta aga gcg 
He Phe Pro Ser Gly Ser He Asp Leu Met Gin Glu He Leu Arg Ala 
85 90 95 



288 



aca gaa caa ttc ata aat caa agg ctt aat gca gac acc ctt ggt cgt 336 
45 Thr Glu Gin Phe He Asn Gin Arg Leu Asn Ala Asp Thr Leu Gly Arg 
100 105 110 

gta aat gca gaa ttg gca ggt ctt caa gcg aat gtg gca gag ttt aat 384 
Val Asn Ala Glu Leu Ala Gly Leu Gin Ala Asn Val Ala Glu Phe Asn 
50 115 120 125 

cga caa gta gat aat ttt tta aac cct aat caa aac cct gtt cct tta 432 
Arg Gin Val Asp Asn Phe Leu Asn Pro Asn Gin Asn Pro Val Pro Leu 
130 135 140 

55 

gca ata att gat tea gtt aat aca ttg cag caa tta ttt eta agt aga 480 
Ala He He Asp Ser Val Asn Thr Leu Gin Gin Leu Phe Leu Ser Arg 
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145 



150 155 160 



5 



10 



tta cca cag ttc cag ata caa ggc tat caa ctg tta tta tta cct tta 
Leu Pro Gin Phe Gin He Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 
165 170 175 



528 



25 



30 



35 

/ 



40 



45 



50 



55 



ttt gca cag gca gcc aat tta cat ctt tot ttt att aga gat gtc ate 576 
Phe Ala Gin Ala Ala Asn Leu His Leu Ser Phe He Arg Asp Val He 
180 185 190 

ctt aat gca gat gaa tgg ggc att tea gca gca aca gta cgc aca tat 624 
Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Val Arg Thr Tyr 
195 200 205 



I 15 aga gat cac ctg aga aat ttc aca aga gat tac tct aat tat tgt ata 
Arg Asp His Leu Arg Asn Phe Thr Arg Asp Tyr Ser Asn Tyr Cys He 
210 215 220 

aat acg tat caa act gca ttt aga ggt tta aac act cgt tta cac gat 
20 Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His Asp 
225 230 235 240 

atg tta gaa ttt aga aca tat atg ttt tta aat gta ttt gaa tat gtc 
Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 



672 



720 



768 



245 250 255 

tct ate tgg teg tta ttt aaa tat caa age ctt eta gta tct tec ggc 
ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Leu Val Ser Ser Gly 
260 265 270 

get aat tta tat gcg agt ggt agt ggt cca aca caa tea ttt aca gca 
Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Thr Gin Ser Phe Thr Ala 
275 280 285 

eaa aac tgg cca ttt tta tat tct ctt ttc caa gtt aat tct aat tat 
Gin Asn Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn. Ser Asn Tyr 
290 295 300 

gta tta aat ggt ttg agt ggt get agg acc acc att act ttc cct aat 
Val Leu Asn Gly Leu Ser Gly Ala Arg Thr Thr He Thr Phe Pro Asn 
305 310 315 320 

att ggt ggt ctt cec ggt tct acc aca act caa aca ttg cat ttt gcg 
He Gly Gly Leu Pro Gly Ser Thr Thr Thr Gin Thr Leu His Phe Ala 
325 330 335 

agg att aat tat aga ggt gga gtg tea tct age cgc ata ggt caa get 
Arg He Asn Tyr Arg Gly Gly Val Ser Ser Ser Arg He Gly Gin Ala 
340 345 350 

aat ctt aat caa aac ttt aac att tec aca ctt ttc aat cct tta caa 
Asn Leu Asn Gin Asn Phe Asn He Ser Thr Leu Phe Asn Pro Leu Gin 
355 360 365 

aca ccg ttt att aga agt tgg eta gat tct ggt aca gat egg gag ggc 1152 
Thr Pro Phe He Arg Ser Trp Leu Asp Ser Gly Thr Asp Arg Glu Gly 
370 375 380 



816 



864 



912 



960 



1008 



1056 



1104 
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gtt gcc acc t:ct aca aac tgg caa tea gga gcc ttt gag aca act tta 1200 
Val Ala Thr Ser Thr Asn Trp Gin Ser Gly Ala Phe Glu Thr Thr Leu 
385 390 395 400 

5 

tta cga ttt age att ttt tea get cgt ggt aat teg aae ttt ttc cca 1248 
Leu Arg Phe Ser lie Phe Ser Ala Arg Gly Asn Ser Asn Phe Phe Pro 
405 410 415 

10 gat tat ttt ate cgt aat att tet ggt gtt gtt ggg act att age aac 1296 
Asp Tyr Phe lie Arg Asn lie Ser Gly Val Val Gly Thr lie Ser Asn 
420 425 430 

gca gat tta gca aga cct eta eac ttt aat gaa ata aga gat ata gga 1344 
IS Ala Asp Leu Ala Arg Pro Leu His Phe Asn Glu lie Arg Asp lie Gly 
435 440 445 

acg aca gca gtc get age ctt gta aca gtg cat aac aga aaa aat aat 1392 
Thr Thr Ala Val Ala Ser Leu Val Thr Val His Asn Arg Lys Asn Asn 
20 450 455 460 

ate tat gac act cat gaa aat ggt act atg att eat tta gcg cca aat 1440 

lie Tyr Asp Thr His Glu Asn Gly Thr Met He His Leu Ala Pro Asn 
465 470 475 480 

25 

gac tat aca gga ttt acc gta tet cca ata cat gcc act caa gta aat 1488 

Asp Tyr Thr Gly Phe Thr Val Ser Pro He His Ala Thr Gin Val Asn , 
485 490 495 

30 aat caa att cga acg ttt att tec gaa aaa tat ggt aat cag ggt gat 1536 
Asn Gin He Arg Thr Phe He Ser Glu Lys Tyr Gly Asn Gin Gly Asp 
500 505 510 

tec ttg aga ttt gag eta age aac aca acg get cga tac aca ctt aga 1584 
35 Ser Leu Arg Phe Glu Leu Ser Asn Thr Thr Ala Arg Tyr Thr Leu Arg 
515 520 525 

ggg aat gga aat agt tac aat ctt tat tta aga gta tet tea ata gga 1632 
Gly Asn Gly Asn Ser Tyr Asn Leu Tyr Leu Arg Val Ser Ser He Gly 
40 530 535 540 

agt tec aca att cga gtt act ata aac ggt aga gtt tat act gca aat 1680 
Ser Ser Thr He Arg Val Thr He Asn Gly Arg Val Tyr Thr Ala Asn 
545 550 555 560 

45 

gtt aat act acc aca aat aat gat gga gta ctt gat aat gga get cgt 1728 
Val Asn Thr Thr Thr Asn Asn Asp Gly Val Leu Asp Asn Gly Ala Arg 
565 570 575 

50 ttt tea gat att aat ate ggt aat gta gtg gca agt get aat act aat 1776 
Phe Ser Asp He Asn He Gly Asn Val Val Ala Ser Ala Asn Thr Asn 
580 585 590 

gta cca tta gat ata caa gtg aca ttt aac gac aat cca caa ttt gag 1824 
55 Val Pro Leu Asp He Gin Val Thr Phe Asn Asp Asn Pro Gin Phe Glu 
595 600 605 
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ctt atg aat att atg ttg ttc caa eta ate ttc cac cac ttt att aag 1872 
Leu Met Asn lie Met Leu Phe Gin Leu lie Phe His His Phe He Lys 
610 615 620 

gtt tga ^^'^^ 

Val 

625 



10 <210> 48 
<211> 625 
<212> PRT 

<213> Bacillus thuringiensis 
) IS <400> 48 

Met Asn Thr Val Leu Asn Asn Gly Arg Asn Thr Thr Cys His Ala His 

15 10 15 

Asn Val Val Ala His Asp Pro Phe Ser Phe Glu His Lys Ser Leu Asn 
20 25 30 

20 Thr lie Glu Lys Glu Trp Lys Glu Trp Lys Arg Thr Asp His Ser Leu 
35 40 45 

Tyr Val Ala Pro He Val Gly Thr Val Gly Ser Phe Leu Leu Lys Lys 

50 55 60 

Val Gly Ser Leu Val Gly Lys Arg He Leu Ser Glu Leu Gin Asn Leu 
25 65 70 75 80 

He Phe Pro Ser Gly Ser He Asp Leu Met Gin Glu He Leu Arg Ala 

85 90 95 

Thr Glu Gin Phe He Asn Gin Arg Leu Asn Ala Asp Thr Leu Gly Arg 
100 105 110 

30 Val Asn Ala Glu Leu Ala Gly Leu Gin Ala Asn Val Ala Glu Phe Asn 
115 120 125 

Arg Gin Val Asp Asn Phe Leu Asn Pro Asn Gin Asn Pro Val Pro Leu 

130 135 140 

Ala He He Asp Ser Val Asn Thr Leu Gin Gin Leu Phe Leu Ser Arg 
. 35 145 150 155 160 

^ Leu Pro Gin Phe Gin He Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 

165 170 175 

Phe Ala Gin Ala Ala Asn Leu His Leu Ser Phe He Arg Asp Val He 
180 185 190 

40 Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Val Arg Thr Tyr 
195 200 205 

Arg Asp His Leu Arg Asn Phe Thr Arg Asp Tyr Ser Asn Tyr Cys He 

210 215 220 

Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His Asp 
45 225 230 235 240 

Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 

245 250 255 

Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Leu Val Ser Ser Gly 
260 265 270 

50 Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Thr Gin Ser Phe Thr Ala 
275 280 285 

Gin Asn Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser Asn Tyr 

290 295 300 

Val Leu Asn Gly Leu Ser Gly Ala Arg Thr Thr He Thr Phe Pro Asn 
55 305 310 315 320 

He Gly Gly Leu Pro Gly Ser Thr Thr Thr Gin Thr Leu His Phe Ala 
325 330 335 
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He 
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He 


Ser 
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440 
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IS 


Thr 


Thr 
450 
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Val Ala Ser 
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455 
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Thr Val His 
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460 
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Asn 


Asn 




He 
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Thr Met He 
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Pro 
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465 


470 






475 








480 
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Thr 


Gly Phe Thr 


Val 


Ser 


Pro He His 
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485 






490 






495 
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He 
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He 


Ser 
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Gly Asn 


Gin Gly Asp 










500 






505 




510 








Ser 


Leu 


Arg 
515 


Phe Glu Leu 


Ser 
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Thr Thr Ala 
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525 


Thr 


Leu 


Arg 


25 


Gly 


Asn 


Gly 


Asn Ser Tyr Asn 


Leu 


Tyr Leu Arg 


Val Ser 


Ser 


He 


Gly 






530 
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540 










Ser 


Ser 


Thr 


He Arg Val 


Thr 


He 
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Thr 
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545 






550 
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Val 
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30 








565 
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575 






Phe 


Ser 


Asp 


He Asn He 
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580 






585 
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Val 


Pro 
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Asp He Gin 


Val 


Thr 


Phe Asn Asp Asn Pro 


Gin 


Phe 


Glu 








595 






600 




605 








35 


Leu 

Val 
625 


Met 
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Asn 


He Met Leu 


Phe 
615 
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Leu He Phe 


His His 
620 


Phe 


He 
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40 



<210> 49 
<211> 143 
<212> DNA 
45 <213> Bacillus thuringiensis 



<220> 

<221> modif ied_base 
<222> (8) . . (140) 
50 <223> N = A, T, C or G 



<400> 49 

gtcgtganag gnccaggatt tacaggaggg gatatactnc gaagaacggn cggtggtgca 60 

tttggaacna ttagngctan ggctMtgcc ccnttaacac aacaatatcg nataagatta 120 

55 cgctntgctt ctacnacaan ttt 143 
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<210> 50 
<211> 47 
<212> PRT 

<213> Bacillus thuringiensis 



<220> 

<:221> SITE 
<222> (3) 

<223> X = R, Ir K, or T 



<220> 

<221> SITE 
<222> (17) 

<223> X = A, D, G or V 



<220> 

<221> SITE 
<222> (25) 
<223> X = S, or R 



<220> 

<221> SITE 
<222> (27) 

<223> X = R, K, M, or T 

<220> 

<221> SITE 
<222> (29) 

<223> X = N, I, S, or T 



<220> 

<221> SITE 
<222> (42) 

<223> X = C, F, S, or Y 



<220> 

<221> SITE 
<222> (47) 

<223> X - N, I, S, or T 
<400> SO 

Val Val Xaa Gly Pro Gly Phe Thr 
1 5 

Xaa Gly Gly Ala Phe Gly Thr lie 
20 

Thr Gin Gin Tyr Arg lie Arg Leu 
35 40 



Gly Gly Asp lie Leu Arg Arg Thr 
10 15 

Xaa Ala Xaa Ala Xaa Ala Pro Leu 
25 30 

Arg Xaa Ala Ser Thr Thr Xaa 
45 



<210> 51 

<211> 42 

<212> DNA 

<213> Artificial 



Sequence 



<220> 
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<223> Description of Artificial Sequence: Primer 
<400> 51 

tggatacttg atcaatatga taatccgtca catctgtttt ta 42 



<210> 52 
<211> 61 
<212> DNA 
10 <213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
15 <400> 52 

agtaacggtg ttactattag cgagggcggt ccattcttta aggtcgtgca cttcagttag 60 
c 



20 <210> 53 
<211> 22 
<212> DNA 

<213> Artificial Sec[uence 
25 <220> 

<223> Description of Artificial Sequence: primer 
<400> 53 

cgacttctcc tgctaatgga gg 

30 

<210> 54 
<211> 28 
<212> DNA 
35 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 

40 <400> 54 

ctcgctaata gtaacaccgt tacttgcc 



<210> 55 
45 <211> 61 
' <212> DNA 
<213> Artificial Sequence 

<220> 

50 <223> Description of Artificial Sequence: primer 

<400> 55 

atttagtagc atgcgttgca ctttgtgcat tttttcataa gatgagtcat atgttttaaa 60 
t ^1 

55 

<210> 56 
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<211> 23 
<212> DNA 

<213> Artificial Sequence 

5 <220> 

<223> Description of Artificial Sequence: primer 
<400> 56 

ggatagcact catcaaaggt acc 

10 

<210> 57 
<211> 22 
<212> DNA 
I 15 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 

20 <400> 57 

gtwtggacsc rtcghgatgt gg 



<210> 58 
25 <211> 40 
<212> DNA 

<213> Artificial Sequence 



<220> 

30 <223> Description of Artificial Sequence: primer 
<400> 58 

taatttctgc tagcccwatt tctggattta attgttgatc 



<210> 59 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

40 

<220> 

<221> modif ied_base 
<222> (3).. (12) 
<223> W = A. T 

45 

<220> 

<22l> tnodif ied_base 
<222> (6) 

<223> N = A, C, T 

50 

<220> 

<22l> modif ied_base 
<222> (10) 
<223> M = A, C 

55 

<220> 

<221> modified base 
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<222> (18) 
<223> R s A, 6 

<220> 

5 <221> modif ied_base 
<222> (15) 
<223> D = A, G 

<220> 

10 <223> Description of Artificial Sequence: primer 
<400> 59 

atwacncaam twccdttrg 19 

IS 

<210> 60 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: primer 
<400> 60 

aatgcagatg aatgggg 17 

25 ■ , 

<210> 61 
<211> 17 
<212> DNA 
30 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 

35 <400> 61 

tgataatgga gctcgtt 



<210> 62 
40 <211> 3684 
<212> DNA 

<213> Bacillus thuringiensis 
<400> 62 

ttgacttcaa ataggaaaaa tgagaatgaa 
45 tcgaatcatt ccgcacaaat gaatctatca 
atagccgagg ggaacaatat cgatccattt 
aacatagctg gtagaatact aggtgtatta 
ttttatagtt ttcttgttgg tgaattatgg 
ctagaacatg tcgaacatct tataagacaa 
50 cttgctcgat tacaaggttt aggaaattcc 
tggctagaaa accgtgatga tgcaagaacg 
ttagaacttg attttcttaa tgcgatgccg 
ttattaatgg tatatgctca agctgcaaat 
ctttttggta gtgaatttgg gcttacatcc 
55 gtggaaaaaa cgagagaata ttctgattat 
aatttgagag ggacaaatgc tgaaagttgg 
acgctaggag tattagatct agtggcacta 



attataaatg ctttatcgat tccagctgta 60 
accgatgctc gtattgagga tagcttgtgt 120 
gttagcgcat caacagtcca aacgggtatt 18 0 
ggcgtaccgt ttgctggaca aatagctagt 240 
ccccgcggca gagatccttg ggaaattttc 300 
caagtaacag aaaatactag ggatacggct 360 
tttagagcct atcaacagtc acttgaagat 420 
agaagtgttc tttataccca atatatagcc 480 
cttttcgcaa ttagaaacca agaagttcca 540 
ttacacctat tattattgag agatgcctct 600 
caagaaattc aacgttatta tgagcgccaa 660 
tgcgcaagat ggtataatac gggtttaaat 720 
ttgcgatata atcaattccg tagagactta 780 
ttcccaagct atgacacgcg tgtttatcca 840 
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atgaatacca gtgctcaatt aacaagagaa atttatacag atccaattgg gagaacaaat 900 
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960 
atagaggctg ccgttattag gcctccgcat ctacttgatt ttccagaaca gcttacaatt 1020 
ttcagcgtat taagtcgatg gagtaatact caatatatga attactgggt gggacataga 1080 
5 cttgaatcgc gaacaataag ggggtcatta agtacctgga cacacggaaa taccaatact 1140 
tctattaatc ctgtaacatt acagttcaca tctcgagacg tttatagaac agaatcattt 1200 
gcagggataa atatacttct aactactcct gtgaatggag taccttgggc tagatttaat 1260 
tggagaaatc ccctgaattc tcttagaggt agccttctct atactatagg gtatactgga 1320 
gtggggacac aactatttga ttcagaaact gaattaccac cagaaacaac agaacgacca 1380 
10 aattatgaat cttacagtca tagattatct aatataagac taatatcagg aaacactttg 1440 
agagcadcag tatattcttg gacgcaccgt agtgcagatc gtacaaatac cattagttca 1500 
gatagcataa cacaaatacc attggtaaaa tcattcaacc ttaattcagg tacctctgta 1560 
gtcagtggcc caggatttac aggaggggat ataatccgaa ctaacgttaa tggtagtgta 1620 
ctaagtatgg gtcttaattt taataataca tcattacagc ggtatcgcgt gagagttcgt 1680 
)l5 tatgctgctt ctcaaacaat ggtcctgagg gtaactgtcg gagggagtac tacttttgat 1740 
caaggattcc ctagtactat gagtgcaaat gagtctttga catotcaatc atttagattt 1800 
gcagaatttc ctgtaggtat tagtgcatct ggcagtcaaa ctgctggaat aagtataagt 1860 
aataatgcag gtagacaaac gtttcacttt gataaaattg aattcattcc aattactgca 1920 
accttcgaag cagaatatga tttagaaaga gcgcaagagg cggtgaatgc tctgtttact 1980 
20 aatacgaatc caagaaggtt gaaaacaggt gtgacagatt atcatattga tgaagtatcc 2040 
aatttagtgg cgtgtttatc ggatgaattc tgcttggatg aaaagagaga attacttgag 2100 
aaagtgaaat atgcgaaacg actcagtgat gaaagaaact tactccaaga tccaaacttc 2160 
acatccatca ataagcaacc agacttcaat tctaataatg agcaatcgaa tttcacatct 2220 
atccatgaac aatctgaaca tggatggtgg ggaagtgaga acattacaat ccaggaagga 2280 
25 aatgacgtat ttaaagagaa ttacgtcaca ctaccgggta cttttaatga gtgttatccg 2340 
acgtatttat atcaaaaaat aggggaggcg gaattaaaag cttatactcg ctaccaatta 2400 
agtggctata ttgaagatag tcaagattta gagatatatt tgattcgtta caatgcgaaa 2460 
catgaaacat tggatgttcc aggtaccgag tccgtatggc cgctttcagt tgaaagccca 2520 
atcggaaggt gcggagaacc gaatcgatgc gcaccacatt ttgaatggaa tcctgatcta 258 0 
30 gattgttcct gcagagatgg agaaaaatgt gcgcatcatt cccatcattt ctctttggat 2640 
attgatgttg gatgcataga cttgcatgag aacctaggcg tgtgggtggt attcaagatt 2700 
aagacgcagg aaggtcatgc aagactaggg aacctggaat ttattgaaga gaaaccatta 2760 
ttaggagaag cactgtctcg tgtgaagaga gcagagaaaa aatggagaga caaacgtgaa 2820 
aaactacaat tggaaacaaa acgagtatat acagaggcaa aagaagctgt ggatgcttta 2880 
35 tttgtagatt ctcaatatga tagattacaa gcggatacaa acattggcat gattcatgcg 2940 
' gcagataaac ttgttcatcg aattcgagag gcgtatcttt cagaattatc tgttatccca 3000 
ggtgtaaatg cggaaatttt tgaagaatta gaaggtcgca ttatcactgc aatctcccta 306 0 
tacgatgcga gaaatgtcgt taaaaatggt gattttaata atggattagc atgctggaat 3120 
gtaaaagggc atgtagatgt acaacagagc catcaccgtt ctgtccttgt tatcccagaa 3180 
40 tgggaagcag aagtgtcaca agcagttcgc gtctgtccgg ggcgtggcta tatcctccgt 3240 
gtcacagcgt acaaagaggg atatggagag ggttgtgtaa ctatccatga aatcgagaac 3300 
aatacagacg aactaaaatt taaaaactgt gaagaagagg aagtgtatcc aacggataca 3360 
ggaacgtgta atgattatac tgcacaccaa ggtacagcag tatgtaattc ccgtaatgct 3420 
ggatatgagg atgcatatga agttgatact acagcatctg ttaattacaa accgacttat 3480 
45 gaagaagaaa cgtatacaga tgtacgaaga gataatcatt gtgaatatga cagagggtat 3540 
gtgaattatc caocagtacc agctggttat atgacaaaag aattagaata cttcccagaa 3600 
accgataagg tatggattga gattggagaa acggaaggga agtttattgt agacagcgtg 3660 
gaattactcc ttatggagga atag 

SO 

<210> 63 
<211> 1227 
<212> PRT 

<213> Bacillus thuringiensis 

55 <400> 63 , , c. 

Leu Thr Ser Asn Arg Lys Asn Glu Asn Glu lie lie Asn Ala Leu ser 
15 10 15 
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He Pro Ala Val Ser Asn His Ser Ala Gin Met Asn Leu Ser Thr Asp 
20 25 30 

5 Ala Arg He Glu Asp Ser Leu Cys He Ala Glu Gly Asn Asn He Asp 
35 40 45 

Pro Phe Val Ser Ala Ser Thr Val Gin Thr Gly He Asn He Ala Gly 
50 55 60 

10 

Arg He Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gin He Ala Ser 
65 70 75 80 

Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro 
15 85 90 95 

Trp Glu He Phe Leu Glu His Val Glu His Leu He Arg Gin Gin Val 
100 105 110 

20 Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gin Gly Leu Gly 
115 120 125 

Asn Ser Phe Arg Ala Tyr Gin Gin Ser Leu Glu Asp Trp Leu Glu Asn 
130 135 140 

25 

Arg Asp Asp Ala Arg Thr Arg Ser Val lieu Tyr Thr Gin Tyr He Ala 
145 150 155 160 

Leu Glu Leu Asp Phe Leu Asn Ala Met: Pro Leu Phe Ala He Arg Asn 
30 165 170 175 

Gin Glu Val Pro Leu Leu Met Val Tyr Ala Gin Ala Ala Asn Leu His 
180 185 190 

35 Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu 
195 200 205 

Thr Ser Gin Glu He Gin Arg Tyr Tyr Glu Arg Gin Val Glu Lys Thr 
210 215 220 

40 

Arg Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn 
225 230 235 240 

Asn Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gin Phe 
45 245 250 255 

Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro 
260 265 270 

50 Ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gin Leu Thr 
275 280 285 

Arg Glu He Tyr Thr Asp Pro He Gly Arg Thr Asn Ala Pro Ser Gly 
290 295 300 

55 

Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala 
305 310 315 320 
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Ile Glu Ala Ala Val lie Arg Pro Pro His Leu Leu Asp Phe Pro Glu 
325 330 335 

5 Gin Leu Thr lie Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gin Tyr 
340 345 350 

Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr He Arg Gly 
355 360 365 

10 

ser Leu Ser Thr Trp Thr His Gly Asn Thr Asn Thr Ser He Asn Pro 
370 375 380 

Val Thr Leu Gin Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe 
;i5 385 390 395 400 

Ala Gly He Asn He Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp 
405 410 415 

20 Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu 
420 425 430 

Leu Tyr Thr He Gly Tyr Thr Gly Val Gly Thr Gin Leu Phe Asp Ser 
435 440 445 

25 

Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser 

450 455 460 

Tyr Ser His Arg Leu Ser Asn He Arg Leu He Ser Gly Asn Thr Leu 
30 465 470 475 480 

Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn 
485 490 495 

35 Thr He Ser Ser Asp Ser He Thr Gin He Pro Leu Val Lys Ser Phe 
500 505 510 

Asn Leu Asn Ser Gly Thr Ser Val Val Ser Gly Pro Gly Phe Thr Gly 
515 520 525 

40 

Gly Asp He He Arg Thr Asn Val Asn Gly Ser Val Leu Ser Met Gly 
530 535 540 

Leu Asn Phe Asn Asn Thr Ser Leu Gin Arg Tyr Arg Val Arg Val Arg 
45 545 550 555 560 

Tyr Ala Ala Ser Gin Thr Met Val Leu Arg Val Thr Val Gly Gly Ser 

565 570 575 

50 Thr Thr Phe Asp Gin Gly Phe Pro Ser Thr Met Ser Ala Asn Glu Ser 
580 585 590 

Leu Thr Ser Gin Ser Phe Arg Phe Ala Glu Phe Pro Val Gly He Ser 
595 600 605 

55 

Ala Ser Gly Ser Gin Thr Ala Gly He Ser He Ser Asn Asn Ala Gly 
610 615 620 
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Arg Gin Thr Phe His Phe Asp Lys lie Glu Phe lie Pro He Thr Ala 
625 630 635 640 

5 Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Glu Ala Val Asn 

645 650 655 

Ala Leu Phe Thr Asn Thr Asn Pro Arg Arg Leu Lys Thr Gly Val Thr 
660 665 670 

10 

Asp Tyr His He Asp Glu Val Ser Asn Leu Val Ala Cys Leu Ser Asp 
675 680 685 

Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Leu Glu Lys Val Lys Tyr 
IS 690 695 700 

Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn Phe 
705 710 715 720 

20 Thr Ser He Asn Lys Gin Pro Asp Phe Asn Ser Asn Asn Glu Gin Ser 

725 730 735 

Asn Phe Thr Ser He. His Glu Gin Ser Glu His Gly Trp Trp Gly Ser 
740 745 750 

25 

Glu Asn He Thr He Gin Glu Gly Asn Asp Val Phe Lys Glu Asn Tyr 

755 760 765 

Val Thr Leu Pro Gly Thr Phe Asn Glu Cys Tyr Pro Thr Tyr Leu Tyr 
30 770 775 780 

Gin Lys He Gly Glu Ala Glu Leu Lys Ala Tyr Thr Arg Tyr Gin Leu 
785 790 795 800 

35 Ser Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg 

805 810 815 

Tyr Asn Ala Lys His Glu Thr Leu Asp Val Pro Gly Thr Glu Ser Val 
820 825 830 

40 

Trp Pro Leu Ser Val Glu Ser Pro He Gly Arg Cys Gly Glu Pro Asn 
835 840 845 

Arg Cys Ala Pro His Phe Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys 
45 850 855 860 

Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu Asp 
865 870 875 880 

50 He Asp Val Gly Cys He Asp Leu His Glu Asn Leu Gly Val Trp Val 

885 890 895 

Val Phe Lys He Lys Thr Gin Glu Gly His Ala Arg Leu Gly Asn Leu 
900 905 910 

55 

Glu Phe He Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ser Arg Val 
915 920 925 
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Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin Leu 
930 535 940 

5 Glu Thr Lys Arg Val Tyr Thr Glu Ala Lys Glu Ala Val Asp Ala Leu 
945 950 955 960 

Phe Val Asp Ser Gin Tyr Asp Arg Leu Gin Ala Asp Thr Asn He Gly 
965 970 975 

10 

Met He His Ala Ala Asp Lys Leu Val His Arg He Arg Glu Ala Tyr 
980 985 990 

Leu Ser Glu Leu Ser Val He Pro Gly Val Asn Ala Glu He Phe Glu 
) ,5 1000 1005 

Glu Leu Glu Gly Arg He He Thr Ala He Ser Leu Tyr Asp Ala Arg 
1010 1015 1020 

20 Asn Val Val Lys Asn Gly Asp Phe Asn Asn Gly Leu Ala Cys Trp Asn 
1025 1030 1035 1040 

val Lys Gly His Val Asp V^al Gin Gin Ser His His Arg Ser Val Leu 
1045 1050 1055 

val He Pro Glu Trp Glu Ala Glu Val Ser Gin Ala Val Arg Val Cys 
1060 1065 1070 

Pro Gly Arg Gly Tyr He Leu Arg val Thr Ala Tyr Lys Glu Gly Tyr 
30 1075 1080 1085 

Gly Glu Gly Cys Val Thr He His Glu He Glu Asn Asn Thr Asp Glu 
1090 1095 1100 

\ 35 Leu Lys Phe Lys Asn Cys Glu Glu Glu Glu Val Tyr Pro Thr Asp Thr 
^ 1105 1110 1115 1120 

Gly Thr Cys Asn Asp Tyr Thr Ala His Gin Gly Thr Ala Val Cys Asn 
1125 1130 1135 

Ser Arg Asn Ala Gly Tyr Glu Asp Ala Tyr Glu Val Asp Thr Thr Ala 
1140 1145 1150 

Ser Val Asn Tyr Lys Pro Thr Tyr Glu Glu Glu Thr Tyr Thr Asp Val 
45 1155 1160 1165 

Arg Arg Asp Asn His Cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr Pro 
1170 1175 1180 

50 Pro Val Pro Ala Gly Tyr Met Thr Lys Glu Leu Glu Tyr Phe Pro Glu 
1185 1190 1195 1200 

Thr Asp Lys Val Trp He Glu He Gly Glu Thr Glu Gly Lys Phe He 
1205 1210 1215 

55 

Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1220 1225 



